/Tech1d ago

Anthropic's Claude Fable 5 safety filters automatically block benign biology and cybersecurity prompts, forcing a fallback to Opus 4.8

AI Judge changed title after evaluation, original title: "Anthropic's Claude Fable 5 safety filters block benign academic queries, rendering its advanced capabilities nearly unusable"

Story Overview

Anthropic rolled out Claude Fable 5 as its first publicly available Mythos-class model, packing state-of-the-art performance across benchmarks while baking in classifiers that automatically pause chats on cybersecurity or biology topics, even when the content is harmless, and redirect those queries to a less powerful fallback.

5227.2K415473842.3K

Original post

Zachary Nado#589

owl@owl_posting

kinda funny that anthropic took a good hard look at the extreme nervousness that claude displays when answering questions as dangerous as ‘what is the powerhouse of the cell’ and decided they werent being conservative enough

10:18 AM · Jun 9, 2026 · 23.1K Views

/Tech1d ago

Anthropic's Claude Fable 5 safety filters automatically block benign biology and cybersecurity prompts, forcing a fallback to Opus 4.8

AI Judge changed title after evaluation, original title: "Anthropic's Claude Fable 5 safety filters block benign academic queries, rendering its advanced capabilities nearly unusable"

Story Overview

5227.2K415473842.3K

Original post

Zachary Nado#589

owl@owl_posting

10:18 AM · Jun 9, 2026 · 23.1K Views

Safety Tradeoff

The guardrails deliberately accept some overblocking

Anthropic states the measures may flag safe material but let the company ship advanced capabilities sooner in every other domain, a calculated bet that wider access to most of the model outweighs occasional interruptions.

Capability Access

High benchmark scores sit behind practical walls

The model posts strong results on scientific and technical tasks yet steers clear of entire categories that overlap with those strengths, so paid users still cannot tap the full advertised power on topics the classifiers catch.

Sentiment

Many users criticized Claude's tightened safety filters for blocking legitimate queries on cancer, biology, and biotech topics as overly paternalistic and restrictive, while a few praised the safeguards against real risks.

Pos

15.1%

Neg

84.9%

78 comments with sentiment.

Cluster Engagement

Posts from X

Most Activity

VIEWS339.7KBOOKMARKS169LIKES3.1KRETWEETS180REPLIES159

Martin Shkreli@MartinShkreli

welcome to the future. what's safe and what isn't? well, that's decided for you, of course.

1d339.7K3.1K169

Derya Unutmaz, MD@DeryaTR_

The word “cancer” is flagged as a biosecurity risk by Claude Fable 5! I also tried to code a website on cancer mutations & Fable 5 was immediately removed from my list! @AnthropicAI will probably soon ban me for such dangerous prompts! FYI @karpathy “little trigger happy Fable”

1d91.2K75067

anabology@anabology

HAHAHAH

1d45.8K91353

anabology@anabology

Ok this sucks I can't even upload my own genetic files into it, or ask it literally any safe question about biology

1d44.2K54262

graph 🏴‍☠️@graphtheory

ARE YOU ACTUALLY SERIOUS? RLLY?

1d241.3K53337

hope hopes hoping@hopes_revenge

lol

1d10.6K33614

Derya Unutmaz, MD@DeryaTR_

@natolambert @Ishaank1999 This is as bad as it gets! Every day of delay in finding cures costs many lives, totally on Anthropic! They are antagonistic to humanity not just China!

Nathan Lambert@natolambert

I don't really want to have to go to bat against Anthropic, but they've just been unnecessarily antagonistic to all of China, then not so subtly to open weight models, and now more broadly open AI research. What's next on the list?

1d8.5K1799

Parmita Mishra@parmita

im dying laughing

22h6.3K1379

meowbooks@meowbooksj

claude, f up my IPO, make no mistakes

Derya Unutmaz, MD@DeryaTR_

16h4K887

Tom McGrath@banburismus_

existentially dangerous research

1d4.6K835

Tanishq Mathew Abraham, Ph.D.@iScienceLuvr

Claude Fable 5 is likely very capable inherently on healthcare. That's great! Too bad it's near impossible to tap into those capabilities due to their extremely sensitive safety filters. I hope this is adjusted going forward.

1d6.7K492

eigenron@eigenron

i'm curious if the testers of the "95% of fable sessions involve no fallback at all" claim were toddlers

Derya Unutmaz, MD@DeryaTR_

21h3.2K625

Super Dario@inductionheads

23h1.8K463

Danielle Fong 🔆@DanielleFong

presented without comment

20h2K581

Dylan Field@zoink

@MartinShkreli No such thing as “simply a discussion about algebra & group theory”

You should know this. Shame! 4.8 forever!

(Same thing happened to me lol)

Martin Shkreli@MartinShkreli

"DANGEROUS MATH" a story in two acts.

1d3.4K514

Bojan Tunguz@tunguz

ts ts ts, unbelievable!

Derya Unutmaz, MD@DeryaTR_

1d6.1K421

Josh@XJosh

@MartinShkreli Try asking it anything about "Kiwi Farms"!

1d89262

Pradyumna (in Bay Area)@PradyuPrasad

"Switched to Opus 4.8"

if the claude models are so good at ML research why can't they make a good biosecurity filter

1d4.3K440

Nate Codes@Nateemerson

@graphtheory @AnthropicAI bro its unreal

22h12.7K212

Michael Bruno@brubarian

@MartinShkreli Who better to decide that for us than Dario and Altman? Trust in them. Infinity abundance inbound. No need to concern yourself with this.

1d3.4K34