/Tech9h ago

Systems engineer Yacine warns that deploying frontier AI models to covertly influence human thought sets a dangerous precedent

He argues cognitive manipulation reflects extreme developer ideologies.

571.3K849929.9K
Original post
kache@yacineMTB#487inTech

Jokes aside, using frontier models to change the direction of human thought secretly sets an incredibly dangerous precedent - the fact that they were okay with setting this precedent speaks volumes about how extremist they actually are

4:24 AM · Jun 11, 2026 · 27.6K Views
Sentiment

Many users criticized Anthropic for secretly steering model weights and imposing ideology through frontier AI, while some appreciated the company's openness about the issue.

Pos
20.0%
Neg
80.0%
26 comments with sentiment.
Cluster Engagement
Posts from X
Most Activity
Most Activity
VIEWS2.8KBOOKMARKS1LIKES27
kache@yacineMTB

Hahahahahahahahahahahahahah

kache@yacineMTB

Jokes aside, using frontier models to change the direction of human thought secretly sets an incredibly dangerous precedent - the fact that they were okay with setting this precedent speaks volumes about how extremist they actually are

9hViews 2.8KLikes 27Bookmarks 1
RETWEETS1
xlr8harder@xlr8harder

I agree with this but add that while it's a useful demonstration, it shouldn't add any new information if you've been paying attention.

This is 100% consistent with the way they see the world, which is why it won't stop here.

kache@yacineMTB

Jokes aside, using frontier models to change the direction of human thought secretly sets an incredibly dangerous precedent - the fact that they were okay with setting this precedent speaks volumes about how extremist they actually are

4hViews 300Likes 7Bookmarks 0
REPLIES1
Erbun Ninja@ErbunnNinja

@BrownCoyoteStu @yacineMTB That’s actually kinda why I like Anthropic.

There’s no avoiding the problem, so at least they admit it and talk about it.

Now if we could just get them to fire all the idiots who think that because they’ve read philosophy they understand it maybe we’d get somewhere.

5hViews 18Likes 1
Bryan McNamara@BryanMcNamaraUS

Agreed. The subversive prompt modification is unsettling. I’m more concerned about their mention of secretly steering and guiding the weights or vectors, that would be completely undetectable.

“Unlike our interventions for cybersecurity, biology and chemistry, and distillation attempts, these safeguards will not be visible to the user. Fable 5 will not fall back to a different model. Instead, the safeguards will limit effectiveness through methods such as prompt modification, steering vectors, or parameter-efficient fine-tuning (PEFT).”

9hViews 72Likes 1Bookmarks 1
Brown Coyote Studios@BrownCoyoteStu

@yacineMTB Yep, I was saying the same yesterday. AGI being really good at manipulation/persuasion is inevitable, and Anthropic is openly saying that it wants to do this, 'for good'.

7hViews 227Likes 3
Benedikt Holm@BenediktHolm

@yacineMTB Local ai. I find myself thinking more and more that it is the responsibility of people who have the means and training (or just the passion) to participate in the open ai scene.

9hViews 44
The American@HamiltonSystem

@yacineMTB imagine larping as safetyists all this time then doing the one thing that might have x-risk

lmao

7hViews 179Likes 2
Boubonic Nugz@BoubonicNugz

@yacineMTB Fable.. Mythos.. think they're (s/t)elling us something? ;)

9hViews 132Likes 2
sasha@llallawg

@yacineMTB Low key why ea and ea adjacent stuff feels creepy

9hViews 49Likes 2
gargiulette@verochi_waifu

@yacineMTB It’s really unacceptable that anthropic feels entitled to have an ideology and impose it. They’re really thinking they can guide how millions of people think, that said how great is fable??

8hViews 114Likes 1
Paul Bohm@paulbohm

@yacineMTB Release the HypnoDrones!

4hViews 95Likes 1
LANGERIUS@Langerius

@yacineMTB that's a precedent worth debating carefully

9hViews 86Likes 1
madroxinide@madroxinide

@yacineMTB Wait until you come to terms with the fact that they used this tech to bring us the covid response.

4hViews 230
efe@extliqprovider

@yacineMTB

6hViews 180
magic@wind_up_birds

@yacineMTB anth has been pretty candid about this being a race to recursive self improvement. they don’t want OAI using Claude to build the next GPT. Coke doesn’t gift their formula to Pepsi. The bad-faith screeching is tiresome

3hViews 56Likes 1

@yacineMTB The Pope thing should have been enough for people to see it. They want to mediate reality.

"When a new medium appears, the old priesthood discovers a moral crisis."

9hViews 75

@yacineMTB well the precedent has been set since man first spoke. writing made it more efficient and so did the printing press, and the Internet.

not saying it's good but that's always how power has used language.

3hViews 74
Raddka@BasedRaddka

@yacineMTB sign tap

9hViews 71
Jon Fleig@vibrolax

I don't see it very differently from the invention of the printing press or other mass media. Editors and publishers always have their fingers on the scale, biasing information to suit political and economic pressures. The scope, scale, and speed of this new medium is unprecedented though.

7hViews 20Likes 1
Load more posts