SGLang lead developer Banghua Zhu argues decoupling training from rollout servers is the correct abstraction for scaling agentic RL workloads · Digg