1d ago

Gavin Baker, Atreides Management CIO, says Composer 2.5 achieved Pareto dominance over GPT-5.5 medium on CursorBench 3.1

The model scored 63% at $0.50 per task.

0
Original post

Composer 2.5 being Pareto dominant in coding per CursorBench is important. This is after only a few weeks of supplemental training and/or RL in the Colossus 2 cluster.   The 1.5 trillion parameter version of Grok will likely be a much better base model than Kimi. We shall see.

9:35 AM · May 27, 2026 View on X

@GavinSBaker Grok 1.5T is trending well

Gavin BakerGavin Baker@GavinSBaker

Composer 2.5 being Pareto dominant in coding per CursorBench is important. This is after only a few weeks of supplemental training and/or RL in the Colossus 2 cluster.   The 1.5 trillion parameter version of Grok will likely be a much better base model than Kimi. We shall see.

4:35 PM · May 27, 2026 · 209.6K Views
6:38 AM · May 28, 2026 · 144.9K Views