/AI3h ago

Prime Intellect's Florian Brand questions whether Claude deliberately sandbags when assisting with research tasks

Practitioners report safety alignment filters visibly degrade model utility

22331261010.7K

#207

Original post

Florian Brand@xeophon#1117inAI

if claude helps you with your research, are you too stupid to notice its sandbagging or is your research not interesting enough to trigger the filters

11:49 AM · Jun 9, 2026 · 11.1K Views

/AI3h ago

Prime Intellect's Florian Brand questions whether Claude deliberately sandbags when assisting with research tasks

Practitioners report safety alignment filters visibly degrade model utility

22331261010.7K

#207

Original post

Florian Brand@xeophon#1117inAI

if claude helps you with your research, are you too stupid to notice its sandbagging or is your research not interesting enough to trigger the filters

11:49 AM · Jun 9, 2026 · 11.1K Views

Sentiment

Many users criticized Claude AI for sandbagging on research queries, seeing the undetectable nerfing as a garbage design choice that sets a troubling precedent.

Pos

0.0%

Neg

100.0%

5 comments with sentiment.

Cluster Engagement

Posts from X

Most Activity

VIEWS435LIKES9

ꜰᴇʀʀᴇᴛ@stferret

@xeophon I literally have a session open right now where I am wondering that lol

3h4359

BOOKMARKS1

Yannick Nick@keennay

@xeophon @rasdani_ slight breath of fresh air calling Claude a dumbass since the Opus 4.5 launch, wondering why it’d make the simplest of mistakes

2h33021

RETWEETS25

Florian Brand@xeophon

if claude helps you with your research, are you too stupid to notice its sandbagging or is your research not interesting enough to trigger the filters

3h11.1K33411

REPLIES1

difficultyang@difficultyang

@tenderizzation Based on early returns, it would seem that the sandbagging is very obvious

1h21770

Alexander Boesgaard@0xBoesgaard

@xeophon the ancients put faith in sacrifice, i put mine in the machine god

3h2245

tender@tenderizzation

@xeophon yes

3h3467

tender@tenderizzation

@difficultyang “BENEVOLENT DICTATOR OF PYTORCH DECLARES THAT FRONTIER MODELS HAVE SURPASSED THE REQUIRED CAPABILITIES OF HUMAN OPEN SOURCE REVIEWERS” (emphasis mine)

1h314

tender@tenderizzation

@difficultyang will this impact the claude pytorch bot

1h683

dinos@din0s_

@xeophon next level ai psychosis

3h2043

difficultyang@difficultyang

@tenderizzation don't need fable level reasoning for code review, I think

1h333

Ω.KendrickPlumard@fouriergalois

@xeophon 3. i am only doing xgboost hahahaha

3h2561

difficultyang@difficultyang

@tenderizzation Actually, maybe I should try this. I'll have to find an issue type that won't trigger sandbagging. There is definitely room for models to improve in code review. It also would probably be insanely expensive.

1h222