It's interesting to see @MicrosoftAI uses ray actors not just for controller and rollout workers but problem workers for the posting training of the MAI-Thinking-1 model. Instead of introducing third party dependency like @modal for sandboxing, Ray actors could provide finer granularity and control for heterogeneous compute which could translate to better utilization of the unused CPU resources in the GPU cluster and easier communication of the agent execution results. Also the part of work I did was to support @sgl_project with @raydistributed backend to better support RL infra especially in weight syncing.
Microsoft AI Uses Ray Actors For MAI-Thinking-1 Post-Training
--0--
Original posts
Reposts
Original post
Robert Nishihara#721
Xinyu Zhang@xinyzng
10:31 PM · Jun 2, 2026 · 1.1K Views
Sentiment
Sentiment building, check back later.
Cluster Engagement
-
Views
-
Comments
-
Reposts
-
Bookmarks
Expand data
Posts from X
Most Activity
Most ActivityTimeline
No ranked X posts are available for this story yet.