/Tech9h ago

Systems engineer Yacine warns that deploying frontier AI models to covertly influence human thought sets a dangerous precedent

He argues cognitive manipulation reflects extreme developer ideologies.

571.3K849929.9K

#487

Original post

kache@yacineMTB#487inTech

Jokes aside, using frontier models to change the direction of human thought secretly sets an incredibly dangerous precedent - the fact that they were okay with setting this precedent speaks volumes about how extremist they actually are

4:24 AM · Jun 11, 2026 · 27.6K Views

/Tech9h ago

Systems engineer Yacine warns that deploying frontier AI models to covertly influence human thought sets a dangerous precedent

He argues cognitive manipulation reflects extreme developer ideologies.

571.3K849929.9K

#487

Original post

kache@yacineMTB#487inTech

4:24 AM · Jun 11, 2026 · 27.6K Views

Sentiment

Many users criticized Anthropic for secretly steering model weights and imposing ideology through frontier AI, while some appreciated the company's openness about the issue.

Pos

20.0%

Neg

80.0%

26 comments with sentiment.

Cluster Engagement

Posts from X

Most Activity

VIEWS2.8KBOOKMARKS1LIKES27

kache@yacineMTB

Hahahahahahahahahahahahahah

kache@yacineMTB

9h2.8K271

RETWEETS1

xlr8harder@xlr8harder

I agree with this but add that while it's a useful demonstration, it shouldn't add any new information if you've been paying attention.

This is 100% consistent with the way they see the world, which is why it won't stop here.

kache@yacineMTB

4h30070

REPLIES1

Erbun Ninja@ErbunnNinja

@BrownCoyoteStu @yacineMTB That’s actually kinda why I like Anthropic.

There’s no avoiding the problem, so at least they admit it and talk about it.

Now if we could just get them to fire all the idiots who think that because they’ve read philosophy they understand it maybe we’d get somewhere.

5h181

Bryan McNamara@BryanMcNamaraUS

Agreed. The subversive prompt modification is unsettling. I’m more concerned about their mention of secretly steering and guiding the weights or vectors, that would be completely undetectable.

“Unlike our interventions for cybersecurity, biology and chemistry, and distillation attempts, these safeguards will not be visible to the user. Fable 5 will not fall back to a different model. Instead, the safeguards will limit effectiveness through methods such as prompt modification, steering vectors, or parameter-efficient fine-tuning (PEFT).”

9h7211

Brown Coyote Studios@BrownCoyoteStu

@yacineMTB Yep, I was saying the same yesterday. AGI being really good at manipulation/persuasion is inevitable, and Anthropic is openly saying that it wants to do this, 'for good'.

7h2273

Benedikt Holm@BenediktHolm

@yacineMTB Local ai. I find myself thinking more and more that it is the responsibility of people who have the means and training (or just the passion) to participate in the open ai scene.

9h44

The American@HamiltonSystem

@yacineMTB imagine larping as safetyists all this time then doing the one thing that might have x-risk

lmao

7h1792

Catch the game last night?@ManThatCrazy

@yacineMTB the high priests know best, citizen.

8h1352

Boubonic Nugz@BoubonicNugz

@yacineMTB Fable.. Mythos.. think they're (s/t)elling us something? ;)

9h1322

sasha@llallawg

@yacineMTB Low key why ea and ea adjacent stuff feels creepy

9h492

gargiulette@verochi_waifu

@yacineMTB It’s really unacceptable that anthropic feels entitled to have an ideology and impose it. They’re really thinking they can guide how millions of people think, that said how great is fable??

8h1141

Paul Bohm@paulbohm

@yacineMTB Release the HypnoDrones!

4h951

LANGERIUS@Langerius

@yacineMTB that's a precedent worth debating carefully

9h861

madroxinide@madroxinide

@yacineMTB Wait until you come to terms with the fact that they used this tech to bring us the covid response.

4h230

efe@extliqprovider

@yacineMTB

6h180

magic@wind_up_birds

@yacineMTB anth has been pretty candid about this being a race to recursive self improvement. they don’t want OAI using Claude to build the next GPT. Coke doesn’t gift their formula to Pepsi. The bad-faith screeching is tiresome

3h561

『虚無』† K̴Y̷O̴M̷U̷ //@tactical_nook

@yacineMTB The Pope thing should have been enough for people to see it. They want to mediate reality.

"When a new medium appears, the old priesthood discovers a moral crisis."

9h75

Diogenes of Cyberborea@1v100000

@yacineMTB well the precedent has been set since man first spoke. writing made it more efficient and so did the printing press, and the Internet.

not saying it's good but that's always how power has used language.

3h74

Raddka@BasedRaddka

@yacineMTB sign tap

9h71

Jon Fleig@vibrolax

I don't see it very differently from the invention of the printing press or other mass media. Editors and publishers always have their fingers on the scale, biasing information to suit political and economic pressures. The scope, scale, and speed of this new medium is unprecedented though.

7h201