Claude Opus 4.8 launches, scoring 69.2% on SWE-bench Pro to outperform GPT-5.5 and Gemini 3.1 Pro
AI Judge changed title after evaluation, original title: "Harshith and Lisan al Gaib share mock benchmark tables speculating on unreleased models including Claude Opus 4.8 and GPT-5.5"
It introduces a "Dynamic Workflows" preview for parallel subagents.
Users react to Anthropic's Claude Opus 4.8 benchmark results, with positive replies praising the coding gains and dynamic workflows while negative ones dismiss the scores as boring, unconvincing, or meaningless.
No Digg Deeper questions have been answered for this story yet.

