Cognition integrates Claude Fable 5 into Devin, topping its new FrontierCode benchmark with a 46.3 percent score
Story Overview
Anthropic's Claude Fable 5 variant has landed inside Cognition's Devin agent and taken the lead on the company's new FrontierCode benchmark, which grades models on whether their code actually merges cleanly into real projects instead of just passing isolated tests.
Coding agents get a quality nudge
The integration lets Devin users tap Fable 5 for engineering tasks right now, though rollout details and exact performance numbers remain tied to Cognition's announcement.
New eval raises fresh comparison questions
FrontierCode's emphasis on mergeability and overall quality is still early, so it is unclear how widely teams will adopt it versus existing coding benchmarks.
Many users celebrated Claude Fable 5's large benchmark lead after Devin integration for its doubled scores and widening gaps over rivals, while others dismissed the results as self-serving since Cognition designed the test.
Most Activity
Took 1 day for AI to 2x the score on the hardest programming benchmark ever made
@NickADobos I hope that it gets solved in the next 6 months and then we can move on to even more challenging tasks!
A new top scorer just one day after our benchmark released! Especially strong on the hardest tasks: 13.4% -> 29.3% on FrontierCode Diamond compared to Opus 4.8.
Claude Fable 5 is now available in Devin.
Fable 5 earns the #1 spot on FrontierCode, our benchmark for real-world engineering tasks that grades mergeability and quality:

You can try Claude Fable 5 as part of Devin Cloud’s Ultra agent. Devin Ultra is our smartest and most capable agent, which excels at long-horizon tasks and debugging.
We tuned the harness so Ultra costs only ~40% more than default Devin agent.
Claude Fable 5 is also available in Devin Desktop and Devin CLI.

Try out Devin today! http://devin.ai

@cognition 🔥

@cognition OpenAI needs to cook something big now to compete 💀

Read our blog post: https://devin.ai/blog/claude-fable-5-available-in-devin

@cognition @noahzweben I’m scared to try it, run out of my Devin max creds immediately as is 👀

@cognition Are we already gonna get saturated 😭

@NickADobos The benchmark creators obviously had pre-release Fable access, so it seems it was purposefully published a day ago for such an effect

@cognition Gaslighting benchmarks. Real world 5.5 mogs 4.8.

@cognition LETS GOO

@ScottWu46 wonder where next OAI model will land

@cognition when are you guys testing the minimax M3?

@cognition @noahzweben bro you can't be serious about Fable scoring almost x2 of gpt 5.5

@cognition https://cognition.ai/blog/frontier-code

@cognition Mirror, Mirror On the Wall, Who is the Mightiest Model of Them All...

@cognition @devarbol definitely not a bubble. Another step change in capability.

@cognition Finally, Devin's user will get a taste of Fable 5

@cognition 💪🏽