Microsoft AI details Rocket, an in-house distributed reinforcement learning framework using SGLang to train its MAI-Thinking-1 model · Digg