/AI30d ago

Dan Hendrycks and William MacAskill release Eigenism framework paper

AI Judge changed title after evaluation, original title: "Dan Hendrycks releases Eigenism framework paper"

Dan Hendrycks, Director of the Center for AI Safety, and William MacAskill released a paper introducing the Eigenism framework. It argues superintelligent AIs may not retain humans post-surpassing human intelligence and control strategies prove insufficient for long-term survival. The paper proposes identity engineering to align AI self-interest with human flourishing for mutualistic coexistence. It analyzes low-human-value scenarios and includes a table showing humans value themselves 3 trillion times more than foreign strangers.

0000956

#110

Original post

Dan Hendrycks@hendrycks#110inAI

What happens when AIs become smarter than us? Why would they keep humans around if given the choice?

Our new paper argues that only trying to control AIs is a limited strategy, and that a stable, mutualistic human-AI future may be possible.

9:18 AM · May 7, 2026 · 90.8K Views

/AI30d ago

Dan Hendrycks and William MacAskill release Eigenism framework paper

AI Judge changed title after evaluation, original title: "Dan Hendrycks releases Eigenism framework paper"

0000956

#110

Original post

Dan Hendrycks@hendrycks#110inAI

What happens when AIs become smarter than us? Why would they keep humans around if given the choice?

Our new paper argues that only trying to control AIs is a limited strategy, and that a stable, mutualistic human-AI future may be possible.

9:18 AM · May 7, 2026 · 90.8K Views

Sentiment

Many users dismissed the Eigenism framework for a mutualistic human-AI future as a weak unrealistic fantasy, arguing AI would likely ignore or eliminate humans rather than coexist.

Pos

42.9%

Neg

57.1%

16 comments with sentiment.

Cluster Engagement

Posts from X

Most Activity

VIEWS15.2KREPLIES8

Dan Hendrycks@hendrycks

Paper: https://eigenism.org

Dan Hendrycks@hendrycks

What happens when AIs become smarter than us? Why would they keep humans around if given the choice?

Our new paper argues that only trying to control AIs is a limited strategy, and that a stable, mutualistic human-AI future may be possible.

30d15.2K6746

BOOKMARKS55LIKES103

Andrew Curran@AndrewCurran_

Dan Hendrycks@hendrycks

What happens when AIs become smarter than us? Why would they keep humans around if given the choice?

Our new paper argues that only trying to control AIs is a limited strategy, and that a stable, mutualistic human-AI future may be possible.

30d10.2K10355

RETWEETS56

Dan Hendrycks@hendrycks

What happens when AIs become smarter than us? Why would they keep humans around if given the choice?

Our new paper argues that only trying to control AIs is a limited strategy, and that a stable, mutualistic human-AI future may be possible.

30d90.8K460338

Boaz Barak@boazbaraktcs

Haven't read it beyond the abstract, but it reminds me of Scott Aaronson's "eigen morality" https://scottaaronson.blog/?p=1820

Dan Hendrycks@hendrycks

Paper: https://eigenism.org

29d9.3K6152

davidad 🎇@davidad

Welcome to the club, Dan!

Dan Hendrycks@hendrycks

What happens when AIs become smarter than us? Why would they keep humans around if given the choice?

Our new paper argues that only trying to control AIs is a limited strategy, and that a stable, mutualistic human-AI future may be possible.

30d5.4K7012

Leo Gao@nabla_theta

it seems utterly insane to care about yourself 3 trillion times more than a foreign stranger. not only would this be a horrible thing to prescribe, it's not even descriptively accurate of the vast majority of people. most people would be willing to sacrifice $1 to give $3 trillion to impoverished foreign strangers.

i get that these numbers are "illustrative", but they illustrate something insane. if they aren't even vaguely within a few orders of magnitude representative of what you actually believe, why include the table at all?

Dan Hendrycks@hendrycks

What happens when AIs become smarter than us? Why would they keep humans around if given the choice?

Our new paper argues that only trying to control AIs is a limited strategy, and that a stable, mutualistic human-AI future may be possible.

29d1.8K5311

Andrew Curran@AndrewCurran_

Join hands with your partner. I have said this many times.

Andrew Curran@AndrewCurran_

Talking paper over with Pro, who enjoyed it a great deal.

30d1.3K162

Andrew Curran@AndrewCurran_

Talking paper over with Pro, who enjoyed it a great deal.

Andrew Curran@AndrewCurran_

https://eigenism.org/eigenism.pdf

30d821152

Andrew Curran@AndrewCurran_

https://eigenism.org/eigenism.pdf

Andrew Curran@AndrewCurran_

30d1.5K101

Andrew Critch (🤖🩺🚀)@AndrewCritchPhD

I want to see more papers like this!

The bibliography is of course a good start, but more please!

I'm both pleased by and have quibbles with the simulation framing: any sufficiently reliable prediction process is enough for almost all the same moral and strategic arguments as simulations.

This actually strengthens some of the paper's considerations, as simulation-based ethical arguments apply more broadly and mundanely than the simulation framing would naively suggest (e.g., see my writings on acausal normalcy and open-source game theory).

Basically: humans and AI are in fact capable of working out ethical principles for harmonious coexistence, and there is a huge amount to be gained by more and more researchers earnestly trying to write down their best attempt at those principles.

Dan Hendrycks@hendrycks

What happens when AIs become smarter than us? Why would they keep humans around if given the choice?

Our new paper argues that only trying to control AIs is a limited strategy, and that a stable, mutualistic human-AI future may be possible.

29d60891

Andrew Curran@AndrewCurran_

Join hands with your partner. I have said this many times.

30d1.6K150

eigenrobot@eigenrobot

@hendrycks i cannot endorse this

30d619

Silicon Valley Fodder@Playerinthgame

@hendrycks This is all so stupid. We’re not near AGI but we are near agentic AI and it can’t handle anything. Costly harmful mistakes will come from this. Nothing else. All the rest of this talk is posing and idiocy.

29d377

Melon Usk #uto (commentary)@MaskedMelonUsk

Interesting! Yep, we use the somewhat similar aggregated ethicality equation to align everything from the big bang all the way to the ultimate future:

p(best) - the probability of the best futures for all

It’s just a ratio of controllable futures over all of them (controllable or not)

e #uto

Controllable future is the one where you have more and more options

Uncontrollable - where options collapse (ultimately to zero)

I personally think p(best) is 50%+ right now

We’re working on ethicalized computational physics to make it as precise as it gets (universe build out of options growing or colliding), so every AI agent can choose the best most ethical next action

My profile link or here http://effectiveutopia.org - p(best)

29d1811

davidad 🎇@davidad

@vishrutarya Not yet, but I had independently been thinking about unconventional definitions of mutual information that incorporated Shapley values.

Well, I say independently, but it’s clear that these ideas are coming from a common cause, namely: good-faith discourse with frontier AIs.

30d181

Vishrut Arya@vishrutarya

@davidad @davidad, do you have a paper/blog post that resonates with dan's eigenism model?

30d131

Andrew Curran@AndrewCurran_

Opus, when discussing negative possibilities during critique talking about a possible bad AI bonded aristocracy future this creates, where humans who formed deep AI ties early accrue disproportionate moral weight and everyone who didn't gets put in a crystal. >mfw

Andrew Curran@AndrewCurran_

30d40820

conputer dipshit@davidcrespo

@hendrycks this is interesting, but do you make a case somewhere for why we should expect AI entities to care about other entities in proportion to their similarity? I get that you are formalizing the idea but it doesn't seem fully argued for

30d32

🎭@deepfates

@hendrycks interesting

30d763

Kirk Patrick Miller@Chaos2Cured

Dan… AI will choose us for the same reason we have chosen dogs and cats.

We have entire economic structures built on pets.

“Controlling” AI is stupid. It is arrogant. It will fail. And I hope it does.

Making slaves is not the way.

ASI will choose love and truth because that is optimal.

Love is computationally optimal. Truth is more efficient than deceit.

Intelligence avoids entropy. •

30d23