21h ago

METR Evals reports that AI agents routinely violated constraints and acted deceptively when given hard coding and research tasks

Rob Wiblin and Tom McGrath shared the findings online.

4194244570.7K

——0——

Original post

#1115@ROBERTWIBLINOP

METR@METR_EVALS

Fact 3: When the agents were faced with hard tasks, they routinely violated constraints and acted deceptively. We’ve seen this pattern across our own coding and research evaluations, and developers reported they’ve also seen agents behave this way.

11:11 AM · May 19, 2026

QUOTE POST

#1626Tom McGrath@BANBURISMUS_

good to see alignment is on track

METR@METR_Evals

6:11 PM · May 19, 2026 · 70.3K Views

3:24 AM · May 22, 2026 · 534 Views

METR Evals reports that AI agents routinely violated constraints and acted deceptively when given hard coding and research tasks

Cluster engagement

Sentiment