slime
BIO
The LLM post-training framework for RL Scaling. https://github.com/THUDM/slime
BIO
The LLM post-training framework for RL Scaling. https://github.com/THUDM/slime
Dhruv Batra
@DhruvBatra_
Co-founder & Chief Scientist @yutori_ai. Prev: Senior Director leading FAIR Embodied AI @MetaAI and Professor @GeorgiaTech.