/Tech3h ago

Eyebench-v3 vision benchmark shows Claude Fable-5 scored 20.0, tying with the older Qwen3.5-Flash

The score beats Claude Opus 4.7 by four points.

9923135.9K

#346

Original post

Teortaxes▶️ (DeepSeek 推特🐋铁粉 2023 – ∞)@teortaxesTex#346inTech

in vision, Claude Fable is on par with an *old 3B active Qwen* (Qwen-Flash is basically just hosted Qwen3.5-35B-A3B) that's all you get as a spillover from general scale

4:07 PM · Jun 9, 2026 · 4.3K Views

/Tech3h ago

Eyebench-v3 vision benchmark shows Claude Fable-5 scored 20.0, tying with the older Qwen3.5-Flash

The score beats Claude Opus 4.7 by four points.

9923135.9K

#346

Original post

Teortaxes▶️ (DeepSeek 推特🐋铁粉 2023 – ∞)@teortaxesTex#346inTech

in vision, Claude Fable is on par with an *old 3B active Qwen* (Qwen-Flash is basically just hosted Qwen3.5-35B-A3B) that's all you get as a spillover from general scale

4:07 PM · Jun 9, 2026 · 4.3K Views

Sentiment

Many users dismissed Claude Fable-5's tie on the Eyebench-V3 vision benchmark as unimpressive, criticizing its vision encoder and expressing frustration that Google and Gemma outperform it.

Pos

0.0%

Neg

100.0%

3 comments with sentiment.

Cluster Engagement

Posts from X

Most Activity

VIEWS909BOOKMARKS2REPLIES2

Teortaxes▶️ (DeepSeek 推特🐋铁粉 2023 – ∞)@teortaxesTex

yeah just one benchmark, I'm exaggerating but this is directionally true. They're not even trying

Teortaxes▶️ (DeepSeek 推特🐋铁粉 2023 – ∞)@teortaxesTex

in vision, Claude Fable is on par with an *old 3B active Qwen* (Qwen-Flash is basically just hosted Qwen3.5-35B-A3B) that's all you get as a spillover from general scale

3h909122

LIKES14

kalomaze@kalomaze

@teortaxesTex God is it a frozen vision encoder or something GOD why is Google mogging them so hard on this

Teortaxes▶️ (DeepSeek 推特🐋铁粉 2023 – ∞)@teortaxesTex

in vision, Claude Fable is on par with an *old 3B active Qwen* (Qwen-Flash is basically just hosted Qwen3.5-35B-A3B) that's all you get as a spillover from general scale

3h523140

kalomaze@kalomaze

@teortaxesTex i trust gemma4 26b more for vision than sonnets

kalomaze@kalomaze

@teortaxesTex God is it a frozen vision encoder or something GOD why is Google mogging them so hard on this

3h29391

Offset Zero@offsetx0

@teortaxesTex It is better in spatial reasoning. Vision is not their main property as of now, but I think this benchmark is heavily concentrated on a very narrow AI blindspot, which is more of a vision encoder benchmark than the model.

3h8