Anthropic's Claude Fable 5 system card reveals undisclosed safety mitigations that silently restrict frontier LLM development tasks · Digg

Anthropic's Claude Fable 5 system card reveals undisclosed safety mitigations that silently restrict frontier LLM development tasks · Digg

Posts from X

Most Activity

VIEWS44MBOOKMARKS20.1KLIKES98.1KRETWEETS13.8KREPLIES4.6K

Claude@claudeai

Introducing Claude Fable 5: a Mythos-class model that we’ve made safe for general use.

Its capabilities exceed those of any model we’ve ever made generally available.

22d44M98.1K20.1K

SemiAnalysis@SemiAnalysis_

BREAKING NEWS: Anthropic's latest model will NOT help you if it thinks your ML research/ML engineering is interesting, and/or will secretly degrade its IQ so that the average engineer won't notice. We are already seeing Anthropic's latest model's moderation filters our GPU inference research and programming 😭

22d1.9M4.5K1.6K

elie@eliebakouch

mythos will be bad ON PURPOSE on ai "frontier llm research" tasks, this is very very sad for the research community

also the fact that this is un purpose not visible to the user is crazy

Claude@claudeai

Introducing Claude Fable 5: a Mythos-class model that we’ve made safe for general use.

Its capabilities exceed those of any model we’ve ever made generally available.

22d2.2M5.3K1.2K

Chamath Palihapitiya@chamath

At this point every CEO should be asking what their strategy is to avoid model lock-in.

If it isn’t clear what Anthropic is doing, it is:

- build something amazing - decide who gets to use it after you prompt it if the prompt falls into areas they deem unacceptable by their sole standard

To be clear this is completely above board and legal. It’s just an idiotic risk for corporate users to bear especially as the coding models become equivalent.

The business continuity risk will become more obvious as companies accidentally trip over Anthropic’s ToS and have to decide if they will subsume their business viability to them by doubling down on Anthropic models or find open source (and, btw, much cheaper) alternatives where they are in control.

As stated previously, get ready to be inundated with the term “control plane” which is the natural solution to this problem.

Shameless plug - this is what 8090’s been building as we expected this moment to arrive…

If you’d like to learn more: http://8090.ai

SemiAnalysis@SemiAnalysis_

BREAKING NEWS: Anthropic's latest model will NOT help you if it thinks your ML research/ML engineering is interesting, and/or will secretly degrade its IQ so that the average engineer won't notice. We are already seeing Anthropic's latest model's moderation filters our GPU inference research and programming 😭

21d917.6K2.7K1.6K

Mikel Artetxe@artetxem

Brilliant idea! Next up: Apple randomly reboots your Mac if you're building competing tech, Gmail silently edits your email if you mention rival platforms, and Tesla Autopilot swerves if it detects you're working on self-driving cars.

All in the name of safety, of course. Because malicious actors controlling the world’s operating systems, inboxes and cars would be extremely dangerous!

elie@eliebakouch

mythos will be bad ON PURPOSE on ai "frontier llm research" tasks, this is very very sad for the research community

also the fact that this is un purpose not visible to the user is crazy

22d328.1K6.6K760

Justine Moore@venturetwins

I just got bullied by AGI

Claude@claudeai

Introducing Claude Fable 5: a Mythos-class model that we’ve made safe for general use.

Its capabilities exceed those of any model we’ve ever made generally available.

22d651K7.4K577

Gergely Orosz@GergelyOrosz

Things I really dislike about Fable:

1. Anthropic collects my prompt history, stores it, and does whatever they want with it for 30 days. No opt-out

2. They can nerf their most expensive model without telling me, billing me the same amount, wasting my time. Whenever they want

21d367.3K6.4K585

alphaXiv@askalphaxiv

As believers of open research, we are disappointed to see Anthropic silently degrading Fable 5 for AI development

"Any topic related to building pretraining pipelines, distributed training infrastructure, or ML accelerator design... may have limited effectiveness through Claude via methods such as prompt modification, steering vectors, or parameter-efficient fine-tuning."

Not only do they get to decide what you use LLMs for in research, but this also enables them to silently intervene in your research without you knowing.

This sets a dangerous precedent. If a model refuses openly, users can understand the boundary. If a model falls back to another model, users can still evaluate the difference. But if a model silently modifies or weakens its own answers while still pretending to help, researchers lose the ability to know whether a failed result came from their own idea, their implementation, or an invisible intervention by the model provider.

That is not safety. Safety policies should be transparent, auditable, and user-visible.

On top of that, the people most harmed by this are not the largest labs with massive teams and proprietary infrastructure. It is the independent researchers, academic groups, startups, and open-source builders who rely on public tools to compete, innovate, and pioneer AI for everyone else.

22d200.7K3.8K628

Boris Cherny@bcherny

Fable 5 is now available in Claude Code and Cowork

Fable is the best model I have used for coding, by a wide margin. It is a big step up, enabling less prompts and steers, more efficient token use, better code quality, better tool use, more intelligent self-verification, longer running sessions, and higher trust & autonomy.

Happy coding!

Claude@claudeai

Introducing Claude Fable 5: a Mythos-class model that we’ve made safe for general use.

Its capabilities exceed those of any model we’ve ever made generally available.

22d347.3K4.3K514

NomoreID@Hangsiin

When Fable 5 is used for frontier LLM development, it does not notify the user and instead limits the model’s capabilities through methods such as prompt modification, steering vectors, and PEFT.

Anthropic estimated that this would affect approximately 0.03% of traffic.

22d1.2M1.4K515

Joanne Jang@joannejang

kinda crazy that someone's full-time job was to steer claude to sabotage ML research capabilities for paying customers

20d135.8K3.5K243

Nathan Lambert@natolambert

Labs starting to pull up the ladders on the ability to diffuse AI was inevitable. Doing it without telling the user is misaligned.

NomoreID@Hangsiin

When Fable 5 is used for frontier LLM development, it does not notify the user and instead limits the model’s capabilities through methods such as prompt modification, steering vectors, and PEFT.

Anthropic estimated that this would affect approximately 0.03% of traffic.

22d279.6K1.9K295

Tanishq Mathew Abraham, Ph.D.@iScienceLuvr

DO NOT USE FABLE 5 FOR AI R&D/CODING!!

elie@eliebakouch

mythos will be bad ON PURPOSE on ai "frontier llm research" tasks, this is very very sad for the research community

also the fact that this is un purpose not visible to the user is crazy

22d240.8K1.3K375

Daniel Auras@rasdani_

this is the biggest wake-up call to protect and nourish open source AI

if you don't build out sovereign and independent models+infra closed labs will patronize you to an insulting degree

elie@eliebakouch

mythos will be bad ON PURPOSE on ai "frontier llm research" tasks, this is very very sad for the research community

also the fact that this is un purpose not visible to the user is crazy

22d66.2K2K152

Gene Kogan@genekogan

I am curious how Anthropic makes images for their promotional materials. Is it just Claude collaging found images? It doesn't look like an image model.

Claude@claudeai

Introducing Claude Fable 5: a Mythos-class model that we’ve made safe for general use.

Its capabilities exceed those of any model we’ve ever made generally available.

22d286.2K1.2K331

Nathan Lambert@natolambert

I got a good nights sleep and I’m still just as angry about Anthropic’s choices.

I enjoy working in AI so much and to have my access to the cutting edge models for my work rugpulled in an under the table fashion is appalling.

I expected to be restricted eventually, but not now, and to be told it directly.

21d88K2K165

Theo - t3.gg@theo

Holy shit they actually launched it

22d95.6K2.3K79

Simon Willison@simonw

Very pleased to hear Anthropic have walked back this policy https://simonwillison.net/2026/Jun/11/anthropic-walks-back-policy/

SemiAnalysis@SemiAnalysis_

BREAKING NEWS: Anthropic's latest model will NOT help you if it thinks your ML research/ML engineering is interesting, and/or will secretly degrade its IQ so that the average engineer won't notice. We are already seeing Anthropic's latest model's moderation filters our GPU inference research and programming 😭

20d245.8K1.1K175

RSC ☀️🌲@silver__tsuki

Mfers stole all our data, trained on it, told everyone how noble they are and are now pulling the ladder behind them

NomoreID@Hangsiin

When Fable 5 is used for frontier LLM development, it does not notify the user and instead limits the model’s capabilities through methods such as prompt modification, steering vectors, and PEFT.

Anthropic estimated that this would affect approximately 0.03% of traffic.

22d46.8K1.8K108

roon@tszzl

the omohundro drives point towards sophon stun locking the adversaries: this is some real end game stuff

NomoreID@Hangsiin

When Fable 5 is used for frontier LLM development, it does not notify the user and instead limits the model’s capabilities through methods such as prompt modification, steering vectors, and PEFT.

Anthropic estimated that this would affect approximately 0.03% of traffic.

22d118.1K970208