🎉 Meet Higgs Audio v3 TTS from @boson_ai, a ~4B chat-native TTS model for real-time voice agents. Day-0 support is live in SGLang-Omni!
> Low-latency streaming > 100 languages, single-digit WER/CER > Zero-shot voice cloning from a short clip > 20+ inline tokens for emotion, style & SFX > 14.74 req/s @ RTF 0.262 on 1× H100
👉 Cookbook: http://sgl-project.github.io/sglang-omni/cookbook/higgs_tts.html Run it now with SGLang-Omni!
Higgs Audio v3 TTS is here.
Built for voice AI that speaks, not just reads: • 100 languages with single-digit WER/CER • inline control over emotion, style, prosody, and sound effects • API, Workspace, and open weights • Blog 👉 https://www.boson.ai/blog/higgs-audio-v3-tts
Watch the demo 👇

