Cartesia releases Sonic-3.5 text-to-speech model that takes first place on Artificial Analysis’s Speech Arena leaderboard in both global and open-weights rankings
It follows the Sonic-1 launch from less than two years earlier.
Extremely proud of the team @cartesia_ai for launching Sonic 3.5, which sets a new state of the art for TTS
I personally led the technical direction of this model; we built it ground up from first principles, and it contains multiple non-trivial ideas that differ substantially from anything we’ve seen in the literature. It’s been very gratifying to see research bets play out and the strong research team at Cartesia continue to grow!
Our new speech model Sonic-3.5 is now #1 on Artificial Analysis's leaderboard.
Less than 2 years ago, we released Sonic-1, the fastest speech model in the world.
Sonic-3.5 now brings the best speech model for conversation with the lowest latency in production.