Claude Fable 5.0 sets a new frontier score on runescape bench. y axis is log scale
Claude Fable 5.0 sets a new frontier score on runescape bench. y axis is log scale
Users appreciate updates on Claude Fable 5.0 setting a record on the Runescape AI Benchmark because it is the benchmark they value most.

Claude Fable adeptly navigates the multi-step supply chains needed for smithing and crafting, and makes sharp navigation of tradeoffs when optimizing. For instance, it's the first model I've seen use the bank to store resources

Comparing to Opus 4.8, it's about the same speed in tokens per second, which surprises me, I expected "mythos" to be giant and slow. Plus it tends to be about 20% shorter in its responses than opus! (Still comes out to 2x the cost per run) You can view the results here: https://maxbittker.github.io/runebench/

@maxbittker The only benchmark I care about: thanks for the update 🙏
Claude Fable 5.0 sets a new frontier score on runescape bench. y axis is log scale