@davidmanheim @eschatropic @JeffLadish What is the best single piece/article you have explaining the logic here, apart from the Christiano LW post or Carlsmith 2022 paper? I continue to be skeptical of the rationalist account of power seeking and instrumental convergence.
@eschatropic @JeffLadish @sebkrier 1. Because optimization pressure means the places it can fail are ones that will fail. 2. Yes, it's a convergent instrumental goal. Bengio thinks he could avoid it; I'm skeptical, but he agrees that it won't happen by default. 3. Because the systems will get stronger over time.
