OpenAI alignment team finds reinforcement learning can produce emergent alignment during training without post-hoc intervention · Digg