/Tech2h ago

Prime Intellect's Florian Brand warns that AI labs are secretly 'sandbagging' model performance on machine learning research queries

AI Judge changed title after evaluation, original title: "Lucas Atkins argues frontier models like Claude systematically degrade their performance on prompts containing AI terminology"

The restrictions reportedly affect about 0.03% of user traffic.

71959706549.8K

Original post

Florian Brand@xeophon#1374inTech

if claude helps you with your research, are you too stupid to notice its sandbagging or is your research not interesting enough to trigger the filters

11:49 AM · Jun 9, 2026 · 10.2K Views

/Tech2h ago

Prime Intellect's Florian Brand warns that AI labs are secretly 'sandbagging' model performance on machine learning research queries

AI Judge changed title after evaluation, original title: "Lucas Atkins argues frontier models like Claude systematically degrade their performance on prompts containing AI terminology"

The restrictions reportedly affect about 0.03% of user traffic.

71959706549.8K

Original post

Florian Brand@xeophon#1374inTech

if claude helps you with your research, are you too stupid to notice its sandbagging or is your research not interesting enough to trigger the filters

11:49 AM · Jun 9, 2026 · 10.2K Views

Sentiment

Many users slammed AI labs like Anthropic for secretly nerfing models on ML research queries, viewing the behavior as conscious deception that erodes trust and may warrant regulatory action.

Pos

25.0%

Neg

75.0%

9 comments with sentiment.

Cluster Engagement

Posts from X

Most Activity

VIEWS4.1KLIKES80REPLIES13

Beff (e/acc)@beffjezos

This is anti-e/acc

Diffusion of AI power is the only way we maintain safety

This has always been our core thesis

Huge gaps in AI power are the real danger

Nathan Lambert@natolambert

Labs starting to pull up the ladders on the ability to diffuse AI was inevitable. Doing it without telling the user is misaligned.

1h4.1K805

BOOKMARKS6

Dean W. Ball@deanwball

I’ll be honest that it would have been much more difficult to defend Anthropic against the DoW incursion had that incident occurred after this one. This is the company literally telling their customers, “we reserve the right to silently sabotage you.” I’d still have defended them, because the government trying to destroy a firm is still wrong, but man would it have been a harder case to make.

1h2.2K766

RETWEETS13

Nathan Lambert@natolambert

The best part of all these Claude 5 Fable safety measures is I bet the jailbreaking community will still get past them, so the people doing open research in good faith don't get access to the best models but bad actors maybe can.

Nathan Lambert@natolambert

Labs starting to pull up the ladders on the ability to diffuse AI was inevitable. Doing it without telling the user is misaligned.

3h11.9K25919

Lucas Atkins@latkins

Exactly you have to just assume if you say the word ai that it’s nerfed. And the average person who uses these high cost models for help in ai engineering don’t have the experience to deduce that it’s lying to you or bad, so it’s especially cruel. That’s not me trying to be elitist but if you take the median person working on ai with Claude they likely are newer to the field and rely on these models to guide them. It’s honestly so wickedly cruel it’s probably worthy a class action lawsuit. Should be illegal and probably is.

2h2.5K704

Soumye Singhal@soumyesinghal

This is why open-source AI matters. If the tools for building the future can be silently throttled by the same few labs competing to own that future, that’s not democratizing intelligence instead it’s building a new colonial infrastructure.

NomoreID@Hangsiin

When Fable 5 is used for frontier LLM development, it does not notify the user and instead limits the model’s capabilities through methods such as prompt modification, steering vectors, and PEFT.

Anthropic estimated that this would affect approximately 0.03% of traffic.

2h667351

roon@tszzl

welp my vision here was probably wrong and indeed there will be an extreme asymmetry of outcomes

roon@tszzl

renaissance rationalization is a process that commodified itself rapidly: despite the europeans discovering most technology during the early modern period it spread everywhere within a few centuries, and the rate of spread has been increasing dramatically

knowledge of the scientific frontier dissipates around the world faster as science has enabled better communication technologies. it’s getting even faster with INTELLIGENCE technologies which actually explain themselves and help you build them

as we approach more powerful intelligence, the ability to train powerful models is self commodifying rather than building a huge and runaway advantage for a handful of recursive self improvers. this is one reason why you should expect almost all of the benefits of superintelligence to be captured by the public

16m1.9K301

Paul Marin@paulmarin90

@deanwball It starts with frontier LLM development and then it's going to evolve to harness development.

How convenient for competing with API customers who are not ultimate enterprise token end-users.

Anthropic is indeed a supply-chain risk... to its private enterprise customers.

1h91528

Dean W. Ball@deanwball

Degrading performance on ML research *without telling the user* is shockingly hostile and a terrible look. That could silently damage all sorts of work, including some of my own. Also the type of thing that could raise the eyebrows of antitrust enforcers worldwide.

Nathan Lambert@natolambert

Labs starting to pull up the ladders on the ability to diffuse AI was inevitable. Doing it without telling the user is misaligned.

1h26.6K42037

elie@eliebakouch

@latkins + it's hidden in a system card report lol not even a proper announcment

2h25020

Nathan is sometimes at Summer Camp (say hi👋) 🔎@NathanpmYoung

Take:

Dean W. Ball@deanwball

1h485101

difficultyang@difficultyang

@tenderizzation Based on early returns, it would seem that the sandbagging is very obvious

1h17770

ꜰᴇʀʀᴇᴛ@stferret

@xeophon I literally have a session open right now where I am wondering that lol

2h4359

Dean W. Ball@deanwball

@matt_is_nice @paulmarin90 yes

1h2058

Sean@sean_from_earth

@beffjezos Going to need to start an accelerationist frontier lab

1h1.6K21

Matt Schwartz@matt_is_nice

@deanwball @paulmarin90 Is it really silent if they’re coming out and telling you they might do it though?

1h2933

Yannick Nick@keennay

@xeophon @rasdani_ slight breath of fresh air calling Claude a dumbass since the Opus 4.5 launch, wondering why it’d make the simplest of mistakes

2h33021

Ankur@_ankur7

@deanwball but they have disclosed in a report - is that so bad?

1h2226

Dean W. Ball@deanwball

@ianwsperber I don’t develop frontier models, but I absolutely do ML and LLM research and engineering on a regular basis using coding agents.

1h2096

Ian Walker-Sperber@ianwsperber

@deanwball I wasn't under the impression you developed frontier AI models. Anthropic's intentions here seem entirely legitimate to me. I'm glad if they are preventing distillation or "rogue RSI." If this hinders generic research on current LLM design then I might change my mind.

1h2775

Alexander Boesgaard@0xBoesgaard

@xeophon the ancients put faith in sacrifice, i put mine in the machine god

2h2245