4h ago

Cerebras places Kimi K2.6, a trillion-parameter model, into enterprise trials running at roughly 1,000 output tokens per second, the highest speed Artificial Analysis has recorded for any frontier model

1102.9K188542279.9K

——0——

Benchmarks show 981 tokens per second on 10,000 input tokens.

Original post

#980@SCALING01OP

Cerebras@CEREBRAS

Cerebras is now running Kimi K2.6 – a trillion parameter model – in enterprise trials. At ~1,000 tokens/s, this is the fastest frontier model performance ever measured by Artificial Analysis @ArtificialAnlys.

9:44 AM · May 19, 2026

QUOTE POST

#687Bojan Tunguz@TUNGUZ

Holy guacamole!

Cerebras@cerebras

4:44 PM · May 19, 2026 · 250.1K Views

5:12 PM · May 19, 2026 · 14.3K Views

QUOTE POST

#980Lisan al Gaib@SCALING01

TPUs are insane

Gemini 3.5 Flash is running at ~867 tokens/s almost as fast as Kimi-K2.6 on Cerebras custom chips

Cerebras@cerebras

4:44 PM · May 19, 2026 · 250.1K Views

5:38 PM · May 19, 2026 · 4.8K Views

Cerebras places Kimi K2.6, a trillion-parameter model, into enterprise trials running at roughly 1,000 output tokens per second, the highest speed Artificial Analysis has recorded for any frontier model

Cluster engagement

Sentiment