@davidmanheim @eschatropic @JeffLadish What is the best single piece/article you have explaining the logic here, apart from the Christiano LW post or Carlsmith 2022 paper? I continue to be skeptical of the rationalist account of power seeking and instrumental convergence.
DeepMind's Séb Krier questions instrumental convergence and asks for literature explaining the logic of AI power-seeking
David Manheim asked Krier to specify his exact objections.
Most Activity
@davidmanheim @eschatropic @JeffLadish A bunch of different things but don't think I'll have the time to elaborate on all aspects here, I'm mostly interested in collecting/re-exploring the key pieces about the worldview.
@sebkrier @eschatropic @JeffLadish I'm not sure, but I don't really understand what's being questioned about the thesis; are you arguing that systems might not have goals, or that they might not want to achieve them, or that generally valuable resources and power might not help with doing so?
@sebkrier @davidmanheim @eschatropic I have some interest in writing this up
@davidmanheim @eschatropic @JeffLadish What is the best single piece/article you have explaining the logic here, apart from the Christiano LW post or Carlsmith 2022 paper? I continue to be skeptical of the rationalist account of power seeking and instrumental convergence.