lixuefeng02

xuefengli lixuefeng02

Achievements

GAIR-NLP/self-improvement-reversal GAIR-NLP/self-improvement-reversal Public

JavaScript 13
GAIR-NLP/LIMR GAIR-NLP/LIMR Public

Python 206 8
GAIR-NLP/ToRL GAIR-NLP/ToRL Public

Python 271 11
GAIR-NLP/abel GAIR-NLP/abel Public

SOTA Math Opensource LLM

Python 334 22
GAIR-NLP/OctoThinker GAIR-NLP/OctoThinker Public

Revisiting Mid-training in the Era of Reinforcement Learning Scaling

Jupyter Notebook 165 12