4h ago

OpenAI Researchers Outline Nuances in Third-Party Frontier Model Evaluations

315481.8K

——0——

Original post

We (@CedricWhitney, @SandhiniAgarwal, @EstherTetruas, @OliviaGWatkins2, @dgrobinson) wrote about nuances we’ve observed while working with third parties on frontier model evals, and why eval standards need to account for them. https://openai.com/index/trustworthy-third-party-evaluations-foundations/

12:41 PM · May 29, 2026

#1008Lama Ahmad لمى احمد@_LAMAAHMAD

The result is no longer just about the model. The harness, tools, safeguards, control loop, budget, and context all shape what an evaluation is actually measuring.

Lama Ahmad لمى احمد@_lamaahmad

7:41 PM · May 29, 2026 · 1K Views

7:41 PM · May 29, 2026 · 881 Views

#1008Lama Ahmad لمى احمد@_LAMAAHMAD

Checking for contamination, broken problems, refusals, reward hacking, and sandbagging is critical. These details can change how much confidence we should place in a result.

Lama Ahmad لمى احمد@_lamaahmad

The result is no longer just about the model. The harness, tools, safeguards, control loop, budget, and context all shape what an evaluation is actually measuring.

7:41 PM · May 29, 2026 · 881 Views

7:41 PM · May 29, 2026 · 196 Views

#1008Lama Ahmad لمى احمد@_LAMAAHMAD

Going forward, third party evaluation standards should require enough detail for decision makers to understand what claims the specific evaluations support, what system was tested, how the result was elicited, and how evaluators checked its validity.

Lama Ahmad لمى احمد@_lamaahmad

Checking for contamination, broken problems, refusals, reward hacking, and sandbagging is critical. These details can change how much confidence we should place in a result.

7:41 PM · May 29, 2026 · 196 Views

7:41 PM · May 29, 2026 · 39 Views