/Tech5h ago

Wharton's Ethan Mollick says Anthropic fails to communicate the sincere safety concerns behind its strict "Mythos-class" model safeguards

This communication gap leaves outside observers highly skeptical.

75576276637.6K
Original post
Ethan Mollick@emollick#184inTech

Two things are true: (1) Anthropic (or parts of it) are absolutely and sincerely worried about the misuse of Mythos-class models & have put in excessive safeguards until they are confident it will not be misused (2) They have not succeeded in explaining/convincing people of this

9:54 AM · Jun 11, 2026 · 34.5K Views
Sentiment

Many users dismissed Anthropic's Mythos model safeguards as ineffective marketing or arrogant fear-mongering that prioritizes monopoly and funding over genuine alignment solutions.

Pos
5.3%
Neg
94.7%
20 comments with sentiment.
Cluster Engagement
Posts from X
Most Activity
Most Activity
VIEWS2.8KBOOKMARKS2LIKES28RETWEETS3
Gergely Orosz@GergelyOrosz

@emollick Not only have they not succeeded explaining it, but they have no response to what are obvious false positives, like nerfing the model for eg the Semi Analysis team and many others.

It sits poorly for paying customers when their vendor silently and deliberately nerfs the model

3hViews 2.8KLikes 28Bookmarks 2
REPLIES3

@emollick The thing is, you can't tell people "I know better than you.". And you need to be sincere. Which means you need to sincerely believe that you don't know better than them.

5hViews 1.3KLikes 19Bookmarks 1

Yes

Ethan Mollick@emollick

Two things are true: (1) Anthropic (or parts of it) are absolutely and sincerely worried about the misuse of Mythos-class models & have put in excessive safeguards until they are confident it will not be misused (2) They have not succeeded in explaining/convincing people of this

3hViews 1.8KLikes 8Bookmarks 1

@emollick This is why, even though Anthropic has the best product, and the best team, I'm bearish on them long-term. Arrogance eventually ruins everything.

5hViews 629Likes 15
Tony Rost@raspberryman

@emollick We’ve reached the point where an institution/firm can publish 100s of pages of reasoning and transparency, and many people will still respond to the presumed motive instead of the argument.

4hViews 296Likes 2Bookmarks 1
David Manheim@davidmanheim

@zooko @emollick I think Dario is clearly scared of the possibility, even though he's not resigned to it. But I don't think that's what drove any of this mistake, just inexperience. (Altman is too political for me to think that I can infer anything about his beliefs from what he says.)

4hViews 466Likes 1
Aliyan Baal@baallives

@emollick A sincere fear that biology as a whole discipline could be misused is paranoia, releasing it with the worst solution possible is profitable paranoia.

5hViews 387Likes 7
Singularitybooks@Singularitybook

@emollick There’s currently a strain of Ebola out there with no vaccine. Why don’t they have the mythical fable make one? There have been vaccines for other strains of Ebola and nobody has made a super virus, so it wouldn’t prove anything if they did it but come on. Make that vaccine.

4hViews 162Likes 3
Dan@robustus

@zooko @emollick I would settle for: they *suspect* they know better than everyone, but acknowledge to themselves (forever) that they are not *sure*.

5hViews 191Likes 2
roanoke_gal@roanoke_gal

@emollick Because we know that if the situation was flipped, and American models were 6-12 mo behind Chinese ones, the same people would be screaming bloody murder if DeepSeek added anti-distalliation filters.

4hViews 38Likes 3
Dev 🧪@zkDragon

@emollick The bad part of the safeguards in (1) is they help advantage attackers in breaking stuff (economy of scale for jailbreaks), and disadvantage defenders in protecting stuff.

It'd be different if it was "safeguards on security prompts get reduced at XYZ time 48 hours from now)"

5hViews 259Likes 4
Mike Bradley@The_Only_Signal

@emollick One thing is true: (1) You can’t speak for the truth of the intentions of the minds of other people.

4hViews 206Likes 4

@davidmanheim @emollick Hm... interesting that we seem to be talking past each other! I assume that the eternal truths about evolution, economics, cooperation, security, social organization, and ethics would still be true in a world populated solely by AIs. :-)

4hViews 43Likes 1

@davidmanheim @emollick If Anthropic and/or OpenAI think those things are going to stop mattering, then that's another reason that I'll be suspicious and bearish about them. :-)

4hViews 41Likes 1

@davidmanheim @emollick And, yeah, if Amodei or Altman think of themselves as short-termers who won't be held accountable for their work, then that would make me trust them less.

4hViews 40Likes 1

@davidmanheim @emollick Yes, I don't expect these basic truths about virtue and social organization to change.

4hViews 39Likes 1
GR@grdotbio

@emollick It seems mostly marketing from Anthropic. And they really want to have a Monopoly on AI.

5hViews 189Likes 3

@davidmanheim @emollick Like… those truths weren't handed down by God, and they weren't invented by a philosopher. They arise out of physics, by way of biology, etc. Why would they stop arising?

4hViews 45

@davidmanheim @emollick Huh. I hold that prospect with low confidence, at a similar level as the prospect that a given CEO or leader will die or be replaced by another human.

4hViews 40
David Manheim@davidmanheim

@zooko @emollick I think it's true of OpenAI as well, and I would agree that in the long run it's usually ruinous, but don't think that there's enough of a long run for the mistakes to return home to roost. Do you think that there's more than a decade left before this stops mattering?

4hViews 39
Load more posts