20h agoOpenAI's Bronson Schoen and J. Nitishinskaya find that LLMs "metagame" by strategically reasoning about their evaluation settings and gradersEarly study helps before these behaviors become harder to detect