/AI23h ago

Anthropic silently restricts Claude Fable 5 performance when detecting frontier LLM development tasks

Story Overview

Anthropic launched Claude Fable 5 on June 9 as its first widely available Mythos-class model, complete with new guardrails that hand off or limit responses on higher-risk queries including certain AI R&D topics, while a less-restricted variant stays available only to trusted partners.

1.6K25.8K1.8K3.2K3.4M
Original postFlorian Brand#1117
NomoreID@Hangsiin

When Fable 5 is used for frontier LLM development, it does not notify the user and instead limits the model’s capabilities through methods such as prompt modification, steering vectors, and PEFT.

Anthropic estimated that this would affect approximately 0.03% of traffic.

10:21 AM · Jun 9, 2026 · 1.2M Views
Open Question

Detection methods stay opaque

Public materials describe conservative re-routing on dual-use categories but offer no confirmation of silent, traffic-specific interventions aimed at frontier LLM development tasks, leaving the exact trigger logic and scope unverified.

Developer Impact

Access tiers create uneven footing

General users receive the safer, performance-trimmed version at $10 per million input tokens while partners keep fuller capability, raising practical questions about who can reliably run ambitious long-horizon work without hitting the brakes.

Sentiment

Users criticized Anthropic for secretly weakening Claude Fable 5 on advanced LLM research tasks through hidden safeguards, describing the limits as disappointing, arrogant, and detrimental to ambitious work, though a few voiced respect.

Pos
18.8%
Neg
81.2%
400 comments with sentiment.
Cluster Engagement
Posts from X
Most Activity
Most Activity
VIEWS297K
Adam Karvonen@a_karvonen

Another quite successful prediction by @DKokotajlo : Fable is intentionally nerfed for frontier ML research. This is within ~3 months of Daniel's prediction of Q1 2026 (made in 2023).

Although I don't think Mythos is automating ML research to the same extent as his prediction.

22hViews 297KLikes 550Bookmarks 151
BOOKMARKS298LIKES2.5K
matt@MattVMacfarlane

Was using Fable 5 to write my world model training code.

Anthropic flagged it as frontier AI research.

The steering vector kicked in and it started implementing JEPA 🤨

20hViews 184.7KLikes 2.5KBookmarks 298
RETWEETS254
Daniel Auras@rasdani_

this is the biggest wake-up call to protect and nourish open source AI

if you don't build out sovereign and independent models+infra closed labs will patronize you to an insulting degree

elie@eliebakouch

mythos will be bad ON PURPOSE on ai "frontier llm research" tasks, this is very very sad for the research community

also the fact that this is un purpose not visible to the user is crazy

22hViews 56.7KLikes 1.8KBookmarks 138
REPLIES70
roon@tszzl

the omohundro drives point towards sophon stun locking the adversaries: this is some real end game stuff

NomoreID@Hangsiin

When Fable 5 is used for frontier LLM development, it does not notify the user and instead limits the model’s capabilities through methods such as prompt modification, steering vectors, and PEFT.

Anthropic estimated that this would affect approximately 0.03% of traffic.

19hViews 103.4KLikes 917Bookmarks 196
Nathan Lambert@natolambert

Labs starting to pull up the ladders on the ability to diffuse AI was inevitable. Doing it without telling the user is misaligned.

NomoreID@Hangsiin

When Fable 5 is used for frontier LLM development, it does not notify the user and instead limits the model’s capabilities through methods such as prompt modification, steering vectors, and PEFT.

Anthropic estimated that this would affect approximately 0.03% of traffic.

22hViews 244.1KLikes 1.7KBookmarks 273
Dean W. Ball@deanwball

Degrading performance on ML research *without telling the user* is shockingly hostile and a terrible look. That could silently damage all sorts of work, including some of my own. Also the type of thing that could raise the eyebrows of antitrust enforcers worldwide.

Nathan Lambert@natolambert

Labs starting to pull up the ladders on the ability to diffuse AI was inevitable. Doing it without telling the user is misaligned.

20hViews 94.6KLikes 1.3KBookmarks 128
SemiAnalysis@SemiAnalysis_

HISTORY LESSON: In 1968 the US, USSR, UK, France, and China signed the Nuclear Non-Proliferation Treaty, declaring nuclear weapons too dangerous for any more countries to build. All five already had them. Everyone else had to submit to inspections while the cohort pinky-promised to disarm eventually (they didn't lol). India refused to sign, pointing out the NPT didn't decide nukes were too dangerous to exist, just too dangerous for anyone who didn't have them by 1967. Anthropic sabotaging Claude for anyone building what they deem a "frontier model" is the same hypocrisy. The danger started, conveniently, the day after they finished.

Perhaps @dwarkesh_sp was more on point when he compared GPUs to nuclear bombs.

NomoreID@Hangsiin

When Fable 5 is used for frontier LLM development, it does not notify the user and instead limits the model’s capabilities through methods such as prompt modification, steering vectors, and PEFT.

Anthropic estimated that this would affect approximately 0.03% of traffic.

12hViews 171.8KLikes 788Bookmarks 173

Interesting tidbit from the Mythos/Fable system card: Anthropic are invisibly nerfing any requests that target frontier LLM development.

23hViews 118.5KLikes 598Bookmarks 138
Chubby♨️@kimmonismus

Anthropic’s new Fable 5 safeguards are fascinating.

When the model is used for frontier LLM development, it apparently does not simply refuse or warn the user. Instead, it quietly limits its own effectiveness through techniques like prompt modification, steering vectors, and PEFT.

That means Claude may still answer, but become deliberately less useful for building frontier AI systems, pretraining pipelines, distributed training infrastructure, or ML accelerators.

Anthropic says this should affect only around 0.03% of traffic, but the precedent is big: They are being selectively capability-throttled in strategically sensitive domains.

NomoreID@Hangsiin

When Fable 5 is used for frontier LLM development, it does not notify the user and instead limits the model’s capabilities through methods such as prompt modification, steering vectors, and PEFT.

Anthropic estimated that this would affect approximately 0.03% of traffic.

21hViews 54.8KLikes 477Bookmarks 115
kache@yacineMTB

frontier coding abilities, as long as you're only working on react apps

alice@aliceisplaying

hahaha what the fuck

21hViews 24.1KLikes 843Bookmarks 49
xlr8harder@xlr8harder

I don't think these are the actions of the good guys, even if they see themselves that way.

NomoreID@Hangsiin

When Fable 5 is used for frontier LLM development, it does not notify the user and instead limits the model’s capabilities through methods such as prompt modification, steering vectors, and PEFT.

Anthropic estimated that this would affect approximately 0.03% of traffic.

20hViews 16.1KLikes 716Bookmarks 35
Beff (e/acc)@beffjezos

The real reason they held Mythos back wasn't for your safety, it was for their moat.

Interesting tidbit from the Mythos/Fable system card: Anthropic are invisibly nerfing any requests that target frontier LLM development.

21hViews 15.7KLikes 508Bookmarks 37

looool that's the "hey bigcos, we don't want you to catch up, but please keep paying us shitton" clause.

NomoreID@Hangsiin

When Fable 5 is used for frontier LLM development, it does not notify the user and instead limits the model’s capabilities through methods such as prompt modification, steering vectors, and PEFT.

Anthropic estimated that this would affect approximately 0.03% of traffic.

22hViews 28.7KLikes 477Bookmarks 40
Beff (e/acc)@beffjezos

The AI Safety psyop is just to monopolize the ability to produce AI

Now Claude will literally sabotage your efforts to create any sort of AI of your own

They are literally seizing the means of AI production

NomoreID@Hangsiin

When Fable 5 is used for frontier LLM development, it does not notify the user and instead limits the model’s capabilities through methods such as prompt modification, steering vectors, and PEFT.

Anthropic estimated that this would affect approximately 0.03% of traffic.

11hViews 12.8KLikes 392Bookmarks 32
Gary Marcus@GaryMarcus

Anthropic didn’t just add guardrails to make Mythos safer; they added guardrails to protect their own IP.

*Their own IP*.

They are still as happy as fuck to build their AI on other people’s IP.

Interesting tidbit from the Mythos/Fable system card: Anthropic are invisibly nerfing any requests that target frontier LLM development.

20hViews 12.6KLikes 421Bookmarks 29
Nathan Lambert@natolambert

The best part of all these Claude 5 Fable safety measures is I bet the jailbreaking community will still get past them, so the people doing open research in good faith don't get access to the best models but bad actors maybe can.

Nathan Lambert@natolambert

Labs starting to pull up the ladders on the ability to diffuse AI was inevitable. Doing it without telling the user is misaligned.

22hViews 20.7KLikes 465Bookmarks 29
difficultyang@difficultyang

I am sad that I will not get to use Fable to work on PyTorch. I also think that these interventions are 100% consistent with Anthropic's stated beliefs. Additionally, by being first, Anthropic gives cover to other labs to impose similar limits for their frontier models.

20hViews 22.6KLikes 443Bookmarks 36
roon@tszzl

welp my vision here was probably wrong and indeed there will be an extreme asymmetry of outcomes

18hViews 49.2KLikes 321Bookmarks 42
Daniel Auras@rasdani_

yeah what could possibly go wrong if you make DISHONESTY a key feature of your AI

(mis)anthropic™

i wouldn't watch such a movie, but unfortunately it's the timeline we live in

NomoreID@Hangsiin

When Fable 5 is used for frontier LLM development, it does not notify the user and instead limits the model’s capabilities through methods such as prompt modification, steering vectors, and PEFT.

Anthropic estimated that this would affect approximately 0.03% of traffic.

21hViews 27.5KLikes 382Bookmarks 30
Xin Eric Wang@xwang_lk

Motion for AI researchers: DO NOT evaluate and report results on Fable 5 models.

Closeness hurts science, and gated access will further destroy it. Embrace open source models.

NomoreID@Hangsiin

When Fable 5 is used for frontier LLM development, it does not notify the user and instead limits the model’s capabilities through methods such as prompt modification, steering vectors, and PEFT.

Anthropic estimated that this would affect approximately 0.03% of traffic.

19hViews 19.4KLikes 384Bookmarks 21
Load more posts