rocke2020

Rocke Dong

rocke2020

No bio provided.

Shanghai
Joined December 2017

verl

verl: Volcano Engine Reinforcement Learning for LLMs

PythonUpdated 5/1/2025

trl

Train transformer language models with reinforcement learning.

PythonUpdated 4/19/2025

RLHF-exercise

No description provided.

Jupyter NotebookUpdated 4/17/2025

AutoDidact

Autonomously train research-agent LLMs on custom data using reinforcement learning and self-verification.

Updated 4/16/2025

Search-R1

Search-R1: An Efficient, Scalable RL Training Framework for Reasoning & Search Engine Calling interleaved LLM based on veRL

PythonUpdated 4/13/2025