Differing Cyber Evals Undermine Claims of AI Model Improvements
——0——
Sentiment
Pos0%
Neg100%
Users criticized inconsistent cyber evaluations across labs and auditors because they create misleading comparisons, communication failures, and results that undermine claims of AI model improvements.