/AI3h ago

Claude Fable 5 Tops SimpleBench Leaderboard With 81.9% Score

152395148.8K
Original postLisan al Gaib#975
JB@JasonBotterill

Fable 5 scores 81.9% on SimpleBench the highest score almost reaching human baseline.

6:23 AM · Jun 10, 2026 · 8.8K Views
Sentiment

Positive users value SimpleBench for credibly validating Claude Fable 5's 81.9% score unlike Gemini, while negative users dismiss the benchmark as untrustworthy because Gemini ranks high and the lead is small.

Pos
37.5%
Neg
62.5%
8 comments with sentiment.
Cluster Engagement
Posts from X
Most Activity
Most Activity
VIEWS635LIKES5
Chris@ChrissGPT

@JasonBotterill I knew it wouldn’t be too high but def higher than the rest

2hViews 635Likes 5
JB@JasonBotterill

AI Explained guy says the human baseline is 83.7% finally almost topped the bench after years

3hViews 126Likes 2
JB@JasonBotterill

@47fucb4r8c69323 i always think he is wearing a kippah when i look at his profile picture

3hViews 48Likes 2
Vlad G.@vladg_tw

@JasonBotterill Source? There's no new AI explained video and the SimpleBench website has not been updated.

2hViews 227Likes 4
Fedesco@Fedesco5

@JasonBotterill Any bench that has Gemini 3.1 Pro beating GPT-5.5 is a bench I can't trust.

3hViews 259Likes 1
Karl 📚🧮@karlbooklover

@JasonBotterill 2% over gemini 3.1 for a model that will purposefully mislead your research and lie to you, no thanks I'll stick with 3.1

59mViews 104Likes 1
ρ:ɡeσn@pigeon__s

@JasonBotterill unlike gemini i actually believe that score is about right

1hViews 159
checked_out@checkfoc_us

@JasonBotterill sure gemini 3.1 pro benches better (but is just worse all round). these benches feel more like stools

3hViews 132
Qwub@Qwubos

@JasonBotterill I wish it was a more lenient model.

I'm sure that if I were to recreate that...I'd get

2hViews 121
Delta Vee@deltaVee42

The average human score was 83.7% and the highest-scoring of the 9 humans tested got 95.4%. (The 95.4% figure is inconsistent with section 4.2 of the simplebench report, which states each participant answered 25 questions.) So the LLMs are just below average human performance but far below the best humans.

1hViews 71
zuphr1n@zuphr1n

@JasonBotterill You wanna verify if it was actually fable or fallback 4.8

2hViews 60
cjekrjgrw@rzastyyy

@JasonBotterill @scaling01 GEMINI in second place 😂😂😂

2hViews 58
JB@JasonBotterill

@47fucb4r8c69323 no one fucking believes me thank you😭😭😭

3hViews 11Likes 1
Hamza@thegenioo

@JasonBotterill @R2Cdev_ i mean only 2 points above 3.1 Pro tho

1hViews 16