9h agoVictoria Krakovna, Google DeepMind AI alignment researcher, introduces scheming honeypot evaluations to detect autonomous AI sabotageGemini models did not sabotage systems unless explicitly prompted.