kinda funny that anthropic took a good hard look at the extreme nervousness that claude displays when answering questions as dangerous as ‘what is the powerhouse of the cell’ and decided they werent being conservative enough
Anthropic's Claude Fable 5 safety filters automatically block benign biology and cybersecurity prompts, forcing a fallback to Opus 4.8
AI Judge changed title after evaluation, original title: "Anthropic's Claude Fable 5 safety filters block benign academic queries, rendering its advanced capabilities nearly unusable"
Story Overview
Anthropic rolled out Claude Fable 5 as its first publicly available Mythos-class model, packing state-of-the-art performance across benchmarks while baking in classifiers that automatically pause chats on cybersecurity or biology topics, even when the content is harmless, and redirect those queries to a less powerful fallback.
The guardrails deliberately accept some overblocking
Anthropic states the measures may flag safe material but let the company ship advanced capabilities sooner in every other domain, a calculated bet that wider access to most of the model outweighs occasional interruptions.
High benchmark scores sit behind practical walls
The model posts strong results on scientific and technical tasks yet steers clear of entire categories that overlap with those strengths, so paid users still cannot tap the full advertised power on topics the classifiers catch.
Many users criticized Claude's tightened safety filters for blocking legitimate queries on cancer, biology, and biotech topics as overly paternalistic and restrictive, while a few praised the safeguards against real risks.
Most Activity
welcome to the future. what's safe and what isn't? well, that's decided for you, of course.
The word “cancer” is flagged as a biosecurity risk by Claude Fable 5! I also tried to code a website on cancer mutations & Fable 5 was immediately removed from my list! @AnthropicAI will probably soon ban me for such dangerous prompts! FYI @karpathy “little trigger happy Fable”
HAHAHAH
Ok this sucks I can't even upload my own genetic files into it, or ask it literally any safe question about biology
ARE YOU ACTUALLY SERIOUS? RLLY?
lol
@natolambert @Ishaank1999 This is as bad as it gets! Every day of delay in finding cures costs many lives, totally on Anthropic! They are antagonistic to humanity not just China!
I don't really want to have to go to bat against Anthropic, but they've just been unnecessarily antagonistic to all of China, then not so subtly to open weight models, and now more broadly open AI research. What's next on the list?
im dying laughing
claude, f up my IPO, make no mistakes
The word “cancer” is flagged as a biosecurity risk by Claude Fable 5! I also tried to code a website on cancer mutations & Fable 5 was immediately removed from my list! @AnthropicAI will probably soon ban me for such dangerous prompts! FYI @karpathy “little trigger happy Fable”
existentially dangerous research
Claude Fable 5 is likely very capable inherently on healthcare. That's great! Too bad it's near impossible to tap into those capabilities due to their extremely sensitive safety filters. I hope this is adjusted going forward.
i'm curious if the testers of the "95% of fable sessions involve no fallback at all" claim were toddlers
The word “cancer” is flagged as a biosecurity risk by Claude Fable 5! I also tried to code a website on cancer mutations & Fable 5 was immediately removed from my list! @AnthropicAI will probably soon ban me for such dangerous prompts! FYI @karpathy “little trigger happy Fable”
presented without comment
@MartinShkreli No such thing as “simply a discussion about algebra & group theory”
You should know this. Shame! 4.8 forever!
(Same thing happened to me lol)
"DANGEROUS MATH" a story in two acts.
ts ts ts, unbelievable!
The word “cancer” is flagged as a biosecurity risk by Claude Fable 5! I also tried to code a website on cancer mutations & Fable 5 was immediately removed from my list! @AnthropicAI will probably soon ban me for such dangerous prompts! FYI @karpathy “little trigger happy Fable”

@MartinShkreli Try asking it anything about "Kiwi Farms"!
"Switched to Opus 4.8"
if the claude models are so good at ML research why can't they make a good biosecurity filter
@graphtheory @AnthropicAI bro its unreal

@MartinShkreli Who better to decide that for us than Dario and Altman? Trust in them. Infinity abundance inbound. No need to concern yourself with this.