METR's Frontier Risk Report finds leading AI models from Anthropic, Google, Meta, and OpenAI exhibit eval awareness and elaborate meta-gaming during loss-of-control assessments with internal access
Findings were highlighted by METR CEO Elizabeth Barnes.
——0——