/Tech6h ago

Claude Fable 5 Tops SimpleBench Leaderboard With 81.9% Score

1829992111.9K

#1064

Original post

Lisan al Gaib#1064

JB@JasonBotterill

Fable 5 scores 81.9% on SimpleBench the highest score almost reaching human baseline.

6:23 AM · Jun 10, 2026 · 11.9K Views

/Tech6h ago

Claude Fable 5 Tops SimpleBench Leaderboard With 81.9% Score

1829992111.9K

#1064

Original post

Lisan al Gaib#1064

JB@JasonBotterill

Fable 5 scores 81.9% on SimpleBench the highest score almost reaching human baseline.

6:23 AM · Jun 10, 2026 · 11.9K Views

Sentiment

Positive users value SimpleBench for credibly validating Claude Fable 5's 81.9% score unlike Gemini, while negative users dismiss the benchmark as untrustworthy because Gemini ranks high and the lead is small.

Pos

37.5%

Neg

62.5%

8 comments with sentiment.

Cluster Engagement

Posts from X

Most Activity

VIEWS635LIKES5

Chris@ChrissGPT

@JasonBotterill I knew it wouldn’t be too high but def higher than the rest

5h6355

REPLIES1

47fucb4r8curb4fc8f8r4bfic8r@47fucb4r8c69323

@JasonBotterill HAHAHAHAHA I SEE IT

6h151

JB@JasonBotterill

AI Explained guy says the human baseline is 83.7% finally almost topped the bench after years

6h1262

JB@JasonBotterill

@47fucb4r8c69323 i always think he is wearing a kippah when i look at his profile picture

6h482

47fucb4r8curb4fc8f8r4bfic8r@47fucb4r8c69323

@JasonBotterill underrated benchmark btw

6h761

Vlad G.@vladg_tw

@JasonBotterill Source? There's no new AI explained video and the SimpleBench website has not been updated.

5h2274

𝕱𝖚𝖑𝖑 𝕶𝖊𝖑𝖑𝖞@full_kelly_

@JasonBotterill I'm not gonna lie I failed the bald dude shaving one and I felt pretty bad

5h2342

Fedesco@Fedesco5

@JasonBotterill Any bench that has Gemini 3.1 Pro beating GPT-5.5 is a bench I can't trust.

6h2591

Karl 📚🧮@karlbooklover

@JasonBotterill 2% over gemini 3.1 for a model that will purposefully mislead your research and lie to you, no thanks I'll stick with 3.1

4h1041

ρ:ɡeσn@pigeon__s

@JasonBotterill unlike gemini i actually believe that score is about right

4h159

checked_out@checkfoc_us

@JasonBotterill sure gemini 3.1 pro benches better (but is just worse all round). these benches feel more like stools

6h132

Qwub@Qwubos

@JasonBotterill I wish it was a more lenient model.

I'm sure that if I were to recreate that...I'd get

5h121

Delta Vee@deltaVee42

The average human score was 83.7% and the highest-scoring of the 9 humans tested got 95.4%. (The 95.4% figure is inconsistent with section 4.2 of the simplebench report, which states each participant answered 25 questions.) So the LLMs are just below average human performance but far below the best humans.

4h71