3h ago

Cartesia releases Ink-2, a streaming speech-to-text model that tops the Artificial Analysis leaderboard for accuracy

Built-in semantic endpoints detect when a user finishes speaking

0
Original post

Cartesia Ink-2 debuts as #1 for accuracy on the brand-new streaming speech-to-text leaderboard from @ArtificialAnlys! We designed Ink-2 from the ground up for voice agents - with low latency, eager transcripts, and semantic endpointing.

9:50 AM · May 28, 2026 View on X
Reposted by

Our new model Ink-2 tops AA's leaderboard for streaming speech-to-text!

Ink-2 comes with plenty of features optimized for real-time voice agents. With top-class models for both TTS and STT, the team at @cartesia keeps pushing the frontier of models for interactive intelligence.

CartesiaCartesia@cartesia

Cartesia Ink-2 debuts as #1 for accuracy on the brand-new streaming speech-to-text leaderboard from @ArtificialAnlys! We designed Ink-2 from the ground up for voice agents - with low latency, eager transcripts, and semantic endpointing.

4:50 PM · May 28, 2026 · 18K Views
5:26 PM · May 28, 2026 · 2.5K Views

Our new speech-to-text model Ink-2 is out and #1 on Artificial Analysis.

It’s built for streaming — low latency, fast eager mode and built in semantic endpoints to detect when users are done talking

New architectures & algorithms made this Pareto-dominance possible

CartesiaCartesia@cartesia

Cartesia Ink-2 debuts as #1 for accuracy on the brand-new streaming speech-to-text leaderboard from @ArtificialAnlys! We designed Ink-2 from the ground up for voice agents - with low latency, eager transcripts, and semantic endpointing.

4:50 PM · May 28, 2026 · 18K Views
5:04 PM · May 28, 2026 · 1.5K Views

Last week @cartesia topped the tts leaderboard, now crushing both ends of the stt-tts sandwich

CartesiaCartesia@cartesia

Cartesia Ink-2 debuts as #1 for accuracy on the brand-new streaming speech-to-text leaderboard from @ArtificialAnlys! We designed Ink-2 from the ground up for voice agents - with low latency, eager transcripts, and semantic endpointing.

4:50 PM · May 28, 2026 · 18K Views
6:06 PM · May 28, 2026 · 350 Views
Cartesia releases Ink-2, a streaming speech-to-text model that tops the Artificial Analysis leaderboard for accuracy · Digg