/Tech7h ago

Anthropic's shift to explicit safety refusals on technical tasks draws pushback from Luma AI's Amit Jain

The expanded filters temporarily block benign programming prompts.

18309171810.9K

#1002

Original post

Hensen Juang@basedjensen

Aight anthropic bros this is the stop I get off mr amodei wild ride ... Jfc what are you all doing

10:14 PM · Jun 10, 2026 · 1.3K Views

/Tech7h ago

Anthropic's shift to explicit safety refusals on technical tasks draws pushback from Luma AI's Amit Jain

The expanded filters temporarily block benign programming prompts.

18309171810.9K

#1002

Original post

Hensen Juang@basedjensen

Aight anthropic bros this is the stop I get off mr amodei wild ride ... Jfc what are you all doing

10:14 PM · Jun 10, 2026 · 1.3K Views

Sentiment

Many users criticize Anthropic's shift to open refusals and broadened safeguards on tasks like ML research and distributed training as cynical PR that blocks open models and erodes trust.

Pos

14.3%

Neg

85.7%

9 comments with sentiment.

Cluster Engagement

Posts from X

Most Activity

VIEWS5KBOOKMARKS4LIKES51RETWEETS3REPLIES1

Gergely Orosz@GergelyOrosz

@emollick Not only have they not succeeded explaining it, but they have no response to what are obvious false positives, like nerfing the model for eg the Semi Analysis team and many others.

It sits poorly for paying customers when their vendor silently and deliberately nerfs the model

SemiAnalysis@SemiAnalysis_

BREAKING NEWS: Anthropic's latest model will NOT help you if it thinks your ML research/ML engineering is interesting, and/or will secretly degrade its IQ so that the average engineer won't notice. We are already seeing Anthropic's latest model's moderation filters our GPU inference research and programming 😭

2h5K514

Perry E. Metzger@perrymetzger

Why do they want to prevent people from doing things like distributed training in the first place? What is the rational explanation for that, other than attempting to prevent people from building open source models? And the whole claim about the false positive surface doesn’t ring true in the first place, it feels like an excuse.

8h1469

The Singularity Project@01Singularity01

@perrymetzger "I'm sorry, Dave. You can't do that." -- HALopic, 2026 A Safe Odyssey

4h1111

Alex Grenier@alexgrenier

The claim is true. The classifier on the release just identifies a few keywords like biology and cybersecurity patterns and immediately routes to Opus 4.8 (which is still impressively capable in both of these topics). Routing entire topics to 4.8 is not viable, so I am sure they will tune the classifier so the positives fit a much narrower and true threat model.

As for the training / LM frontier research, it's similar, but the category is vastly different.

There's a dynamic there where, upon the release of every new frontier model, it immediately gets assaulted by "LM researchers" including nefarious ones e.g. China-based. Not just for distillation but other forms of extract-and-launder schemes.

On one hand, I understand they don't want to give their Frontier capabilities to Frontier competitors and foreign adversaries.

And on the other hand, a lot of so called ML researchers are totally fake and just extract new models as they come out and race to be the first to take credit for and publish AI-generated white papers with their name on it.

I believe essentially these were the 2 surfaces they sought to reduce.

I'm close to $1000 in Fable tokens in the past 24 hours, on two projects, one is biology and the other is Frontier AI, and I have had absolutely zero issues with the classifier.

So I think it has profiled accounts and it doesn't just classify based on the prompt, but in general the user's historical behavior.

Like someone might be asking a being biology question today, but if previously you were, put of curiosity, asking Opud to synthesize meth or RNA sequence to make ebola air born, you are likely barred from these topics on Fable.

This is my educated guess.

8h73

Perry E. Metzger@perrymetzger

@n00buntu I wrote that myself. I have no problem with using AI to write stuff, mind you, and don’t think it’s shameful to use AI.

7h463

Ab Homine Deus@AbHomineDeus

@perrymetzger How can anyone trust they're not still secretly sandbagging? Once the mask is off...

8h566

Perry E. Metzger@perrymetzger

@alexgrenier Apologetics, especially thin ones, are not going to change anyone’s mind. People like @DeryaTR_ can’t even ask an innocent thing like “what is a mitochondrion”, and claims that they can’t build a better classifier are horseshit, these things are amazing classifiers.

8h356

Jeff@InquisitiveUrsa

@perrymetzger It is quite obviously significantly better, because their previous actions were really bad.

8h701

Alex Grenier@alexgrenier

@perrymetzger What they wrote was that the tradeoff to release Fable rapidly would be to have a large false-positive surface and that they're working to improve the classifier and reduce that as fast as possible.

8h189

n00buntu@n00buntu

@perrymetzger > That problem is not the specifics of this AI system, it is Anthropic and its ideological goals.

😆ifiuc you asked Claude to write a post criticizing Anthropic, just as the anti-data-centre activists used ChatGPT to write their diatribes?

Hilariously good fun.

7h82

Perry E. Metzger@perrymetzger

@InquisitiveUrsa You can make something better and still not make it good.

8h574

Black Lodge Games@blacklodgegames

@perrymetzger It's also impossible to know if they are even telling the truth. They could simply continue degrading performance silently at any time.

5h833

Lambda Rick 🏴‍☠️/acc@benrayfield

@perrymetzger Anthropic probably wants to wait until the heat dies down before doing the exact same thing again.

6h243

John Ennis@johnennis

@perrymetzger Correct

7h1551

Lambda Rick 🏴‍☠️/acc@benrayfield

@perrymetzger GPT has never refused any of my questions about AI algorithms in general nor to write code attempting to do them.

6h201

GooGZ AI@PaulGugAI

@perrymetzger Thanks for speaking up about this. Hopefully those with more influence on this platform continue to do so and keep some pressure on.

9h22

Don't think, it might be uncomfortable@presence2222

@perrymetzger Then we'll simply refuse to pay Anthropic. There are dozens of very good AI companies now, and not all of them are as pretentious and know-it-all as Anthropic.

6h17

Unit Accord@unit_accord

@perrymetzger They're not ideological goals. That's just their PR strategy.

Analyze them like cynical economic agents.

6h10

n00buntu@n00buntu

@perrymetzger > I wrote that myself.

Ah. I thought perhaps you were poking fun at Anthropic.

And yes, everything you've written makes sense and illuminates the situation for me. Thank you. 🙏

6h6

JK@_junaidkhalid1

Switching from silent sabotage to open refusal is a transparency improvement, sure. But it doesn't address who gets to define the boundaries, or why those definitions keep expanding to catch perfectly standard technical work. The policy change and the underlying ideology are two separate things, and conflating them does a lot of work for Anthropic here.

4h1