Google DeepMind's Andreas Kirsch argues instruction following does not guarantee alignment, warning against dangerous recursive self-improvement · Digg