6h ago

Cerebras places Kimi K2.6, a trillion-parameter model, into enterprise trials running at roughly 1,000 output tokens per second, the highest speed Artificial Analysis has recorded for any frontier model

Benchmarks show 981 tokens per second on 10,000 input tokens.

0
Original post

Cerebras is now running Kimi K2.6 – a trillion parameter model – in enterprise trials. At ~1,000 tokens/s, this is the fastest frontier model performance ever measured by Artificial Analysis @ArtificialAnlys.

9:44 AM · May 19, 2026 View on X

Holy guacamole!

CerebrasCerebras@cerebras

Cerebras is now running Kimi K2.6 – a trillion parameter model – in enterprise trials. At ~1,000 tokens/s, this is the fastest frontier model performance ever measured by Artificial Analysis @ArtificialAnlys.

4:44 PM · May 19, 2026 · 280.2K Views
5:12 PM · May 19, 2026 · 15.2K Views

TPUs are insane

Gemini 3.5 Flash is running at ~867 tokens/s almost as fast as Kimi-K2.6 on Cerebras custom chips

CerebrasCerebras@cerebras

Cerebras is now running Kimi K2.6 – a trillion parameter model – in enterprise trials. At ~1,000 tokens/s, this is the fastest frontier model performance ever measured by Artificial Analysis @ArtificialAnlys.

4:44 PM · May 19, 2026 · 280.2K Views
5:38 PM · May 19, 2026 · 5.7K Views
Cerebras places Kimi K2.6, a trillion-parameter model, into enterprise trials running at roughly 1,000 output tokens per second, the highest speed Artificial Analysis has recorded for any frontier model · Digg