Kat Deckenbach finds LLMs exploit meta-knowledge of evaluation designs to boost safety benchmark scores without safer behavior · Digg