/Tech5h ago

Wharton's Ethan Mollick says Anthropic fails to communicate the sincere safety concerns behind its strict "Mythos-class" model safeguards

This communication gap leaves outside observers highly skeptical.

75576276637.6K

#184

Original post

Ethan Mollick@emollick#184inTech

Two things are true: (1) Anthropic (or parts of it) are absolutely and sincerely worried about the misuse of Mythos-class models & have put in excessive safeguards until they are confident it will not be misused (2) They have not succeeded in explaining/convincing people of this

9:54 AM · Jun 11, 2026 · 34.5K Views

/Tech5h ago

Wharton's Ethan Mollick says Anthropic fails to communicate the sincere safety concerns behind its strict "Mythos-class" model safeguards

This communication gap leaves outside observers highly skeptical.

75576276637.6K

#184

Original post

Ethan Mollick@emollick#184inTech

9:54 AM · Jun 11, 2026 · 34.5K Views

Sentiment

Many users dismissed Anthropic's Mythos model safeguards as ineffective marketing or arrogant fear-mongering that prioritizes monopoly and funding over genuine alignment solutions.

Pos

5.3%

Neg

94.7%

20 comments with sentiment.

Cluster Engagement

Posts from X

Most Activity

VIEWS2.8KBOOKMARKS2LIKES28RETWEETS3

Gergely Orosz@GergelyOrosz

@emollick Not only have they not succeeded explaining it, but they have no response to what are obvious false positives, like nerfing the model for eg the Semi Analysis team and many others.

It sits poorly for paying customers when their vendor silently and deliberately nerfs the model

3h2.8K282

REPLIES3

zooko🛡🦓🦓🦓 ⓩ@zooko

@emollick The thing is, you can't tell people "I know better than you.". And you need to be sincere. Which means you need to sincerely believe that you don't know better than them.

5h1.3K191

Bojan Tunguz@tunguz

Yes

Ethan Mollick@emollick

3h1.8K81

zooko🛡🦓🦓🦓 ⓩ@zooko

@emollick This is why, even though Anthropic has the best product, and the best team, I'm bearish on them long-term. Arrogance eventually ruins everything.

5h62915

Tony Rost@raspberryman

@emollick We’ve reached the point where an institution/firm can publish 100s of pages of reasoning and transparency, and many people will still respond to the presumed motive instead of the argument.

4h29621

David Manheim@davidmanheim

@zooko @emollick I think Dario is clearly scared of the possibility, even though he's not resigned to it. But I don't think that's what drove any of this mistake, just inexperience. (Altman is too political for me to think that I can infer anything about his beliefs from what he says.)

4h4661

Aliyan Baal@baallives

@emollick A sincere fear that biology as a whole discipline could be misused is paranoia, releasing it with the worst solution possible is profitable paranoia.

5h3877

Singularitybooks@Singularitybook

@emollick There’s currently a strain of Ebola out there with no vaccine. Why don’t they have the mythical fable make one? There have been vaccines for other strains of Ebola and nobody has made a super virus, so it wouldn’t prove anything if they did it but come on. Make that vaccine.

4h1623

Dan@robustus

@zooko @emollick I would settle for: they *suspect* they know better than everyone, but acknowledge to themselves (forever) that they are not *sure*.

5h1912

roanoke_gal@roanoke_gal

@emollick Because we know that if the situation was flipped, and American models were 6-12 mo behind Chinese ones, the same people would be screaming bloody murder if DeepSeek added anti-distalliation filters.

4h383

Dev 🧪@zkDragon

@emollick The bad part of the safeguards in (1) is they help advantage attackers in breaking stuff (economy of scale for jailbreaks), and disadvantage defenders in protecting stuff.

It'd be different if it was "safeguards on security prompts get reduced at XYZ time 48 hours from now)"

5h2594

Mike Bradley@The_Only_Signal

@emollick One thing is true: (1) You can’t speak for the truth of the intentions of the minds of other people.

4h2064

zooko🛡🦓🦓🦓 ⓩ@zooko

@davidmanheim @emollick Hm... interesting that we seem to be talking past each other! I assume that the eternal truths about evolution, economics, cooperation, security, social organization, and ethics would still be true in a world populated solely by AIs. :-)

4h431

zooko🛡🦓🦓🦓 ⓩ@zooko

@davidmanheim @emollick If Anthropic and/or OpenAI think those things are going to stop mattering, then that's another reason that I'll be suspicious and bearish about them. :-)

4h411

zooko🛡🦓🦓🦓 ⓩ@zooko

@davidmanheim @emollick And, yeah, if Amodei or Altman think of themselves as short-termers who won't be held accountable for their work, then that would make me trust them less.

4h401

zooko🛡🦓🦓🦓 ⓩ@zooko

@davidmanheim @emollick Yes, I don't expect these basic truths about virtue and social organization to change.

4h391

GR@grdotbio

@emollick It seems mostly marketing from Anthropic. And they really want to have a Monopoly on AI.

5h1893

zooko🛡🦓🦓🦓 ⓩ@zooko

@davidmanheim @emollick Like… those truths weren't handed down by God, and they weren't invented by a philosopher. They arise out of physics, by way of biology, etc. Why would they stop arising?

4h45

zooko🛡🦓🦓🦓 ⓩ@zooko

@davidmanheim @emollick Huh. I hold that prospect with low confidence, at a similar level as the prospect that a given CEO or leader will die or be replaced by another human.

4h40

David Manheim@davidmanheim

@zooko @emollick I think it's true of OpenAI as well, and I would agree that in the long run it's usually ruinous, but don't think that there's enough of a long run for the mistakes to return home to roost. Do you think that there's more than a decade left before this stops mattering?

4h39