/Tech1d ago

Cognition integrates Claude Fable 5 into Devin, topping its new FrontierCode benchmark with a 46.3 percent score

Story Overview

Anthropic's Claude Fable 5 variant has landed inside Cognition's Devin agent and taken the lead on the company's new FrontierCode benchmark, which grades models on whether their code actually merges cleanly into real projects instead of just passing isolated tests.

1202K120195188.4K

#746

Original post

Super Dario#1958

Cognition@cognition#799inTech

Claude Fable 5 is now available in Devin.

Fable 5 earns the #1 spot on FrontierCode, our benchmark for real-world engineering tasks that grades mergeability and quality:

10:25 AM · Jun 9, 2026 · 135.6K Views

/Tech1d ago

Cognition integrates Claude Fable 5 into Devin, topping its new FrontierCode benchmark with a 46.3 percent score

Story Overview

1202K120195188.4K

#746

Original post

Super Dario#1958

Cognition@cognition#799inTech

Claude Fable 5 is now available in Devin.

Fable 5 earns the #1 spot on FrontierCode, our benchmark for real-world engineering tasks that grades mergeability and quality:

10:25 AM · Jun 9, 2026 · 135.6K Views

Developer Impact

Coding agents get a quality nudge

The integration lets Devin users tap Fable 5 for engineering tasks right now, though rollout details and exact performance numbers remain tied to Cognition's announcement.

Open Question

New eval raises fresh comparison questions

FrontierCode's emphasis on mergeability and overall quality is still early, so it is unclear how widely teams will adopt it versus existing coding benchmarks.

Sentiment

Many users celebrated Claude Fable 5's large benchmark lead after Devin integration for its doubled scores and widening gaps over rivals, while others dismissed the results as self-serving since Cognition designed the test.

Pos

60.3%

Neg

39.7%

25 comments with sentiment.

Cluster Engagement

Posts from X

Most Activity

VIEWS31.8KBOOKMARKS28LIKES352

Nick Dobos@NickADobos

Took 1 day for AI to 2x the score on the hardest programming benchmark ever made

Scott Wu@ScottWu46

@NickADobos I hope that it gets solved in the next 6 months and then we can move on to even more challenging tasks!

1d31.8K35228

RETWEETS10REPLIES11

Scott Wu@ScottWu46

A new top scorer just one day after our benchmark released! Especially strong on the hardest tasks: 13.4% -> 29.3% on FrontierCode Diamond compared to Opus 4.8.

Cognition@cognition

Claude Fable 5 is now available in Devin.

Fable 5 earns the #1 spot on FrontierCode, our benchmark for real-world engineering tasks that grades mergeability and quality:

1d14.8K21211

Cognition@cognition

You can try Claude Fable 5 as part of Devin Cloud’s Ultra agent. Devin Ultra is our smartest and most capable agent, which excels at long-horizon tasks and debugging.

We tuned the harness so Ultra costs only ~40% more than default Devin agent.

Claude Fable 5 is also available in Devin Desktop and Devin CLI.

1d3.3K403

Cognition@cognition

Try out Devin today! http://devin.ai

1d1.8K10