/AI3h ago

Claude Fable 5 Tops SimpleBench Leaderboard With 81.9% Score

152395148.8K

#975

Original post

Lisan al Gaib#975

JB@JasonBotterill

Fable 5 scores 81.9% on SimpleBench the highest score almost reaching human baseline.

6:23 AM · Jun 10, 2026 · 8.8K Views

/AI3h ago

Claude Fable 5 Tops SimpleBench Leaderboard With 81.9% Score

152395148.8K

#975

Original post

Lisan al Gaib#975

JB@JasonBotterill

Fable 5 scores 81.9% on SimpleBench the highest score almost reaching human baseline.

6:23 AM · Jun 10, 2026 · 8.8K Views

Sentiment

Positive users value SimpleBench for credibly validating Claude Fable 5's 81.9% score unlike Gemini, while negative users dismiss the benchmark as untrustworthy because Gemini ranks high and the lead is small.

Pos

37.5%

Neg

62.5%

8 comments with sentiment.

Cluster Engagement

Posts from X

Most Activity

VIEWS635LIKES5

Chris@ChrissGPT

@JasonBotterill I knew it wouldn’t be too high but def higher than the rest

2h6355

REPLIES1

47fucb4r8curb4fc8f8r4bfic8r@47fucb4r8c69323

@JasonBotterill HAHAHAHAHA I SEE IT

3h151

JB@JasonBotterill

AI Explained guy says the human baseline is 83.7% finally almost topped the bench after years

3h1262

JB@JasonBotterill

@47fucb4r8c69323 i always think he is wearing a kippah when i look at his profile picture

3h482

47fucb4r8curb4fc8f8r4bfic8r@47fucb4r8c69323

@JasonBotterill underrated benchmark btw

3h761

Vlad G.@vladg_tw

@JasonBotterill Source? There's no new AI explained video and the SimpleBench website has not been updated.

2h2274

𝕱𝖚𝖑𝖑 𝕶𝖊𝖑𝖑𝖞@full_kelly_

@JasonBotterill I'm not gonna lie I failed the bald dude shaving one and I felt pretty bad

2h2342

Fedesco@Fedesco5

@JasonBotterill Any bench that has Gemini 3.1 Pro beating GPT-5.5 is a bench I can't trust.

3h2591

Karl 📚🧮@karlbooklover

@JasonBotterill 2% over gemini 3.1 for a model that will purposefully mislead your research and lie to you, no thanks I'll stick with 3.1

59m1041

ρ:ɡeσn@pigeon__s

@JasonBotterill unlike gemini i actually believe that score is about right

1h159

checked_out@checkfoc_us

@JasonBotterill sure gemini 3.1 pro benches better (but is just worse all round). these benches feel more like stools

3h132

Qwub@Qwubos

@JasonBotterill I wish it was a more lenient model.

I'm sure that if I were to recreate that...I'd get

2h121

Delta Vee@deltaVee42

The average human score was 83.7% and the highest-scoring of the 9 humans tested got 95.4%. (The 95.4% figure is inconsistent with section 4.2 of the simplebench report, which states each participant answered 25 questions.) So the LLMs are just below average human performance but far below the best humans.

1h71