Many are belatedly realizing that intelligence must be open.
For open intelligence to succeed, developers must work together across institutional lines.
That's why I'm particularly excited about this collab across @modal, @sgl_project, and Z Lab:
We worked with @lmsysorg and http://z-lab.ai to - integrate DFlash spec into @sgl_project - make it faster with overlap - train a DFlash drafter for @Alibaba_Qwen 397B-A17B
The result: up to 4.3x greater throughput over baseline and 1.5x over native MTP.




