/AI6h ago

Claude Opus 4.8 scores 1.5% on ARC-AGI-3, tripling the previous record set by GPT-5.5

Greg Kamradt evaluated the reasoning logs using AmpCode agents.

--0--
ARC Prize@arcprize

Anthropic Opus 4.8 is new SOTA on ARC-AGI-3

Score: 1.5%, ~$10K

ARC-AGI-3 analysis notes: * Opus 4.8 read the environment an abstraction *above* Opus 4.7, as objects & systems, not pictures * Opus 4.8 succeeded on early levels, but still committed to a wrong sub-goal

11:15 AM · Jun 1, 2026 · 65.3K Views
Sentiment
Sentiment unavailable for this story.
Cluster Engagement
-
Views
-
Comments
-
Reposts
-
Bookmarks
Expand data
Posts from X
Most Activity
Most ActivityTimeline
VIEWS82KBOOKMARKS140LIKES1.2KRETWEETS29REPLIES49
Lisan al Gaib@scaling01

Opus 4.8 just broke ARC-AGI-3

it tripled GPT-5.5's score

we are now at a breathtaking 1.5% human efficiency

6hViews 82KLikes 1.2KBookmarks 140
Claude Opus 4.8 scores 1.5% on ARC-AGI-3, tripling the previous record set by GPT-5.5 · Digg