/Tech1d ago

Prime Intellect's Florian Brand questions whether Claude deliberately sandbags when assisting with research tasks

Practitioners report safety alignment filters visibly degrade model utility

30659403630.6K

#218

Original post

Florian Brand@xeophon#1190inTech

if claude helps you with your research, are you too stupid to notice its sandbagging or is your research not interesting enough to trigger the filters

11:49 AM · Jun 9, 2026 · 30.1K Views

/Tech1d ago

Prime Intellect's Florian Brand questions whether Claude deliberately sandbags when assisting with research tasks

Practitioners report safety alignment filters visibly degrade model utility

30659403630.6K

#218

Original post

Florian Brand@xeophon#1190inTech

if claude helps you with your research, are you too stupid to notice its sandbagging or is your research not interesting enough to trigger the filters

11:49 AM · Jun 9, 2026 · 30.1K Views

Sentiment

Many users criticized Claude AI for sandbagging on research queries, seeing the undetectable nerfing as a garbage design choice that sets a troubling precedent.

Pos

0.0%

Neg

100.0%

5 comments with sentiment.

Cluster Engagement

Posts from X

Most Activity

VIEWS642LIKES10REPLIES1

difficultyang@difficultyang

@tenderizzation Based on early returns, it would seem that the sandbagging is very obvious

23h642100

BOOKMARKS1

Yannick Nick@keennay

@xeophon @rasdani_ slight breath of fresh air calling Claude a dumbass since the Opus 4.5 launch, wondering why it’d make the simplest of mistakes

1d33021

RETWEETS25

Florian Brand@xeophon

if claude helps you with your research, are you too stupid to notice its sandbagging or is your research not interesting enough to trigger the filters

1d30.1K64936

ꜰᴇʀʀᴇᴛ@stferret

@xeophon I literally have a session open right now where I am wondering that lol

1d4359

Alexander Boesgaard@0xBoesgaard

@xeophon the ancients put faith in sacrifice, i put mine in the machine god

1d2245

tender@tenderizzation

@xeophon yes

1d3467

tender@tenderizzation

@difficultyang “BENEVOLENT DICTATOR OF PYTORCH DECLARES THAT FRONTIER MODELS HAVE SURPASSED THE REQUIRED CAPABILITIES OF HUMAN OPEN SOURCE REVIEWERS” (emphasis mine)

23h314

tender@tenderizzation

@difficultyang will this impact the claude pytorch bot

23h683

dinos@din0s_

@xeophon next level ai psychosis

1d2043

difficultyang@difficultyang

@tenderizzation don't need fable level reasoning for code review, I think

23h333

Ω.KendrickPlumard@fouriergalois

@xeophon 3. i am only doing xgboost hahahaha

1d2561

difficultyang@difficultyang

@tenderizzation Actually, maybe I should try this. I'll have to find an issue type that won't trigger sandbagging. There is definitely room for models to improve in code review. It also would probably be insanely expensive.

23h222