@deepfates @jdchawla29 Want to work on reward hacking on the near future; will comb through the rollouts then to see whether models specifically mention eval settings and if so, how those rollouts differ from others
@xeophon @jdchawla29 Oh, if this is the motte then you're really just claiming a lack of evidence in which case why don't you do an experiment improve eval awareness is jumped up. because everybody will be really excited to find this out