Gavin Baker, Atreides Management CIO, says Composer 2.5 achieved Pareto dominance over GPT-5.5 medium on CursorBench 3.1
The model scored 63% at $0.50 per task.
——0——
@GavinSBaker Grok 1.5T is trending well
Composer 2.5 being Pareto dominant in coding per CursorBench is important. This is after only a few weeks of supplemental training and/or RL in the Colossus 2 cluster. The 1.5 trillion parameter version of Grok will likely be a much better base model than Kimi. We shall see.
4:35 PM · May 27, 2026 · 209.6K Views
6:38 AM · May 28, 2026 · 144.9K Views