/Tech1d ago

Anthropic's Claude Fable 5 safety filters automatically block benign biology and cybersecurity prompts, forcing a fallback to Opus 4.8

AI Judge changed title after evaluation, original title: "Anthropic's Claude Fable 5 safety filters block benign academic queries, rendering its advanced capabilities nearly unusable"

Story Overview

Anthropic rolled out Claude Fable 5 as its first publicly available Mythos-class model, packing state-of-the-art performance across benchmarks while baking in classifiers that automatically pause chats on cybersecurity or biology topics, even when the content is harmless, and redirect those queries to a less powerful fallback.

5227.2K415473842.3K
Original postZachary Nado#589
owl@owl_posting

kinda funny that anthropic took a good hard look at the extreme nervousness that claude displays when answering questions as dangerous as ‘what is the powerhouse of the cell’ and decided they werent being conservative enough

10:18 AM · Jun 9, 2026 · 23.1K Views
Safety Tradeoff

The guardrails deliberately accept some overblocking

Anthropic states the measures may flag safe material but let the company ship advanced capabilities sooner in every other domain, a calculated bet that wider access to most of the model outweighs occasional interruptions.

Capability Access

High benchmark scores sit behind practical walls

The model posts strong results on scientific and technical tasks yet steers clear of entire categories that overlap with those strengths, so paid users still cannot tap the full advertised power on topics the classifiers catch.

Sentiment

Many users criticized Claude's tightened safety filters for blocking legitimate queries on cancer, biology, and biotech topics as overly paternalistic and restrictive, while a few praised the safeguards against real risks.

Pos
15.1%
Neg
84.9%
78 comments with sentiment.
Cluster Engagement
Posts from X
Most Activity
Most Activity
VIEWS339.7KBOOKMARKS169LIKES3.1KRETWEETS180REPLIES159
Martin Shkreli@MartinShkreli

welcome to the future. what's safe and what isn't? well, that's decided for you, of course.

1dViews 339.7KLikes 3.1KBookmarks 169

The word “cancer” is flagged as a biosecurity risk by Claude Fable 5! I also tried to code a website on cancer mutations & Fable 5 was immediately removed from my list! @AnthropicAI will probably soon ban me for such dangerous prompts! FYI @karpathy “little trigger happy Fable”

1dViews 91.2KLikes 750Bookmarks 67
anabology@anabology

HAHAHAH

1dViews 45.8KLikes 913Bookmarks 53
anabology@anabology

Ok this sucks I can't even upload my own genetic files into it, or ask it literally any safe question about biology

1dViews 44.2KLikes 542Bookmarks 62

ARE YOU ACTUALLY SERIOUS? RLLY?

1dViews 241.3KLikes 533Bookmarks 37

@natolambert @Ishaank1999 This is as bad as it gets! Every day of delay in finding cures costs many lives, totally on Anthropic! They are antagonistic to humanity not just China!

Nathan Lambert@natolambert

I don't really want to have to go to bat against Anthropic, but they've just been unnecessarily antagonistic to all of China, then not so subtly to open weight models, and now more broadly open AI research. What's next on the list?

1dViews 8.5KLikes 179Bookmarks 9

im dying laughing

22hViews 6.3KLikes 137Bookmarks 9
meowbooks@meowbooksj

claude, f up my IPO, make no mistakes

The word “cancer” is flagged as a biosecurity risk by Claude Fable 5! I also tried to code a website on cancer mutations & Fable 5 was immediately removed from my list! @AnthropicAI will probably soon ban me for such dangerous prompts! FYI @karpathy “little trigger happy Fable”

16hViews 4KLikes 88Bookmarks 7
Tom McGrath@banburismus_

existentially dangerous research

1dViews 4.6KLikes 83Bookmarks 5

Claude Fable 5 is likely very capable inherently on healthcare. That's great! Too bad it's near impossible to tap into those capabilities due to their extremely sensitive safety filters. I hope this is adjusted going forward.

1dViews 6.7KLikes 49Bookmarks 2
eigenron@eigenron

i'm curious if the testers of the "95% of fable sessions involve no fallback at all" claim were toddlers

The word “cancer” is flagged as a biosecurity risk by Claude Fable 5! I also tried to code a website on cancer mutations & Fable 5 was immediately removed from my list! @AnthropicAI will probably soon ban me for such dangerous prompts! FYI @karpathy “little trigger happy Fable”

21hViews 3.2KLikes 62Bookmarks 5
Super Dario@inductionheads
23hViews 1.8KLikes 46Bookmarks 3

@MartinShkreli No such thing as “simply a discussion about algebra & group theory”

You should know this. Shame! 4.8 forever!

(Same thing happened to me lol)

Martin Shkreli@MartinShkreli

"DANGEROUS MATH" a story in two acts.

1dViews 3.4KLikes 51Bookmarks 4

ts ts ts, unbelievable!

The word “cancer” is flagged as a biosecurity risk by Claude Fable 5! I also tried to code a website on cancer mutations & Fable 5 was immediately removed from my list! @AnthropicAI will probably soon ban me for such dangerous prompts! FYI @karpathy “little trigger happy Fable”

1dViews 6.1KLikes 42Bookmarks 1
Josh@XJosh

@MartinShkreli Try asking it anything about "Kiwi Farms"!

1dViews 892Likes 62

"Switched to Opus 4.8"

if the claude models are so good at ML research why can't they make a good biosecurity filter

1dViews 4.3KLikes 44Bookmarks 0
Nate Codes@Nateemerson

@graphtheory @AnthropicAI bro its unreal

22hViews 12.7KLikes 21Bookmarks 2
Michael Bruno@brubarian

@MartinShkreli Who better to decide that for us than Dario and Altman? Trust in them. Infinity abundance inbound. No need to concern yourself with this.

1dViews 3.4KLikes 34
Load more posts