DR Tulu is now accepted for an oral presentation at #ICML2026 🙏
Updated paper: https://arxiv.org/abs/2511.19399 📥We added more ablations including using Qwen3-8B as the rubric generator&judge, showing evolving rubrics work with a weak model too; spurious rewards sanity check, etc.
Live demo: https://www.dr-tulu.org/ Code&models: https://github.com/rlresearch/dr-tulu
Happy to share that DR Tulu has been accepted to ICML as a ✨Spotlight✨!
We believe that co-evolving the agent and its reward metric can lead to more capable intelligence.
DR Tulu is a team effort. Huge thanks and congrats to all my amazing collaborators and mentors!