1d ago

Anthropic Draws Criticism for Benchmarking Against GPT-5.5

1541.8K95265309.1K

——0——

Original post

Anthropic did a big strategic error. Normally they compare their models with their old models. Instead today, now that everybody knows how strong GPT 5.5 is at coding, they put it in the mix, basically showing all their customers that the benchmarks can't be trusted.

10:40 AM · May 28, 2026