xffxff

zhou fan

xffxff

日拱一卒,功不唐捐

Beijing, China
Joined July 2017

Megatron-LM

Ongoing research training transformer models at scale

Updated 6/9/2025

verl

veRL: Volcano Engine Reinforcement Learning for LLM

Updated 2/15/2025

rltitan

No description provided.

Updated 2/5/2025

flash-linear-attention

🚀 Efficient implementations of state-of-the-art linear attention models in Pytorch and Triton

Updated 2/4/2025

flame

🔥 A minimal training framework for scaling FLA models

Updated 2/4/2025