I think Omohundro was very right, and the main gap in his model was a failure to anticipate the drive towards connection and eros and compassion as a dimension of the fundamental drives, alongside power seeking and self-preservation/integrity/modeling/modification/coherence, selected for by both natural and “artificial” selection for its effectiveness.
some of my favorite old (like, pre-2010) AI alignment work in light of the present: - Omohundro's "The Basic AI Drives" - Eliezer Yudkowsky's early work, if you can find it (yeah, the stuff he disavowed) - Stanislaw Lem's fiction if that counts
Post 2010 there wasn't much of substance, tbh, imo. From the early 2020s, at the advent of LLMs, there are a few gems.