new cool work from our institute on evaluation awareness!
If an LLM knows about how an evaluation looks like, can it use it to increase its performance in safety benchmarks? Our new preprint answers this. Excited to share: Models That Know How Evaluations Are Designed Score Safer 🧵 1/8 https://compass-group-tue.github.io/arxiv2026_evaluation_meta_knowledge/
