SGLang lead developer Banghua Zhu says SGLang hit 12,000 tokens per second per GPU running DeepSeek V4 Pro on GB300 NVL72 · Digg