@xeophon this would be pretty surprising, considering everything else we know about language models. would you like to try to back this up with a real claim?
I don’t think eval awareness is a real thing // is as much of a problem as people make it out to be
