Google DeepMind's Neel Nanda and team use open-ended agents to automatically detect behavioral differences between language models · Digg