1d ago

Gavin Baker, Atreides Management CIO, says Composer 2.5 achieved Pareto dominance over GPT-5.5 medium on CursorBench 3.1

The model scored 63% at $0.50 per task.

3854.4K340235354.5K

——0——

Original post

Composer 2.5 being Pareto dominant in coding per CursorBench is important. This is after only a few weeks of supplemental training and/or RL in the Colossus 2 cluster. The 1.5 trillion parameter version of Grok will likely be a much better base model than Kimi. We shall see.

9:35 AM · May 27, 2026

#76Elon Musk@ELONMUSK

@GavinSBaker Grok 1.5T is trending well

Gavin Baker@GavinSBaker

4:35 PM · May 27, 2026 · 209.6K Views

6:38 AM · May 28, 2026 · 144.9K Views