At @modal, we're working to make sure OSS RL frameworks have all the techniques necessary to train frontier open-weights models. Delta compression is key, but the job's not done. There are still lots of open problems around weight sync, auto-scaling, & cross-cluster training.
My DMs are open!
@FireworksAI_HQ + @cursor_ai highlighted why delta-compressed weight sync matters for RL at frontier scale.
slime brings this capability to OSS: lossless delta sync for Megatron ↔ SGLang disaggregation — ship deltas, not full checkpoints.
This is another step toward a fully open-source stack where rollout/inference and training are truly decoupled and deployed separately.
PR: https://github.com/THUDM/slime/pull/1806


