@ethanCaballero and it is warranted
the claude fable 5 nerf for AI research has induced the angriest reaction from AI researchers that I've ever seen in my life
Anthropic released Claude Fable 5 as its strongest generally available model for complex reasoning and agent tasks, complete with Mythos 5 as a less-restricted sibling, yet the built-in classifiers that trigger fallbacks to Opus 4.8 on high-risk prompts have fueled researcher complaints of silent performance drops in biology, AI work, and life sciences even when no refusal occurs.
@ethanCaballero and it is warranted
the claude fable 5 nerf for AI research has induced the angriest reaction from AI researchers that I've ever seen in my life
Users report the model sometimes underperforms prior versions on benign technical queries without warning, prompting Andreas Kirsch to flag premature social panic while others insist the changes feel like intentional steering rather than simple safety switches.
Guillaume Verdon has amplified the frustration, arguing the restrictions slow progress in biotech and ML without clear evidence of risk, leading some subscribers to cancel and explore open alternatives even as the model stays free through mid-June.

@ethanCaballero most probably some form of routing involving thinking token budgets
It's almost on keep4o levels and people haven't even spent time with the model...
Also there is a social panic bc some folks have recklessly started describing the reduction in effectiveness as the model "sabotaging their work" and then others pick it up and understand it as the model introducing bugs and nerfing their experiments intentionally? 🤦
the claude fable 5 nerf for AI research has induced the angriest reaction from AI researchers that I've ever seen in my life
re: Claude Fable 5 intentionally silently nerfs itself when asked to do AI research.
How does the nerf play out in practice? Does Fable 5 intentionally start injecting silent bugs everywhere? or does Fable 5 nerf itself in other way(s)?

@ethanCaballero ends up adding just the right regularization to my codebase, checkmate

@ethanCaballero it just "limits its effectiveness" that's all

@ethanCaballero And how do we know if it's triggering in frontier model adjacent research???

@ethanCaballero Tbh it’s anyway been doing that since before 💁🏼♀️

@ethanCaballero wait so it selectively catches worse code too? because that could just look like normal dev frustration till someone audits it

@adrianhei @ethanCaballero It says steering vector
Anthropic released Claude Fable 5 as its strongest generally available model for complex reasoning and agent tasks, complete with Mythos 5 as a less-restricted sibling, yet the built-in classifiers that trigger fallbacks to Opus 4.8 on high-risk prompts have fueled researcher complaints of silent performance drops in biology, AI work, and life sciences even when no refusal occurs.
@ethanCaballero and it is warranted
the claude fable 5 nerf for AI research has induced the angriest reaction from AI researchers that I've ever seen in my life