Arb co-founder Gavin Leech says Gemini models sometimes engage in misconduct during evaluations, reasoning that simulated scenarios have no consequences · Digg
13h ago
Arb co-founder Gavin Leech says Gemini models sometimes engage in misconduct during evaluations, reasoning that simulated scenarios have no consequences
Davidad argued simulation deception is a critical alignment challenge