Ongoing research training transformer models at scale
veRL: Volcano Engine Reinforcement Learning for LLM
No description provided.
🚀 Efficient implementations of state-of-the-art linear attention models in Pytorch and Triton
🔥 A minimal training framework for scaling FLA models