7h agoMicrosoft AI leveraged Ray to train its new 1-trillion parameter MAI-Thinking-1 Mixture-of-Experts reasoning modelR1 top authorRay managed 30,000 CPU cores to evaluate coding environments.Original postRAray@raydistributedCongratulations to the Microsoft AI team on MAI-Thinking-1! Exciting to see Ray used in multiple parts of frontier-model development. - Fast pre-training recovery via in-job restarts with hot standbys - Async RL orchestration (managing learners, inference servers, rollout workers, and routers, each with distinct placement and fault-tolerance needs) - A two-pool Ray cluster for building and grading SWE environments on 30K CPU cores5:19 PM · Jun 3, 20261 more postRetweeted by Robert Nishihara·7hView on