1d ago

METR Researcher Clarifies Companies Lack Editorial Control Over Reports

0
Original post

@xeophon @METR_Evals I'd say companies controlled what non-public evidence we could cite, but didn't have editorial control. They signed off on all that *before* we wrote the report. After that, we were free to write & publish whatever we thought given our evidence w/o them controlling the text.

12:07 PM · May 19, 2026 View on X

We also offered the option to approve quotes *without attribution*. Through this mechanism, a company could disclose evidence that speaks to industry-wide issues (examples of reward hacking, lax controls, etc.), which they would individually have incentives not to publish.

Charles FosterCharles Foster@CFGeek

Then we told each company what evidence we wanted to quote in our public report and negotiated any redactions. Our agreement carved out certain info, but generally they could block anything we couldn't have acquired ourselves (like by reading the news or evaluating public APIs).

5:22 PM · May 20, 2026 · 61 Views
5:22 PM · May 20, 2026 · 66 Views

Then we told each company what evidence we wanted to quote in our public report and negotiated any redactions. Our agreement carved out certain info, but generally they could block anything we couldn't have acquired ourselves (like by reading the news or evaluating public APIs).

Charles FosterCharles Foster@CFGeek

As the first step in our process, we ran entity-level assessments for each company. These asked "What's the holistic picture of loss-of-control at [Company] at this point in time?", not "What's the risk from this specific [Company] model launch decision?".

5:22 PM · May 20, 2026 · 78 Views
5:22 PM · May 20, 2026 · 61 Views

As a backstop, we gave ourselves a carved-out right to publish a redaction summary indicator for each company. That sentence would let us flag whether a company insisted on a redaction we felt was material and blocked us from noting the specific redaction. No participant did.

Charles FosterCharles Foster@CFGeek

We also offered the option to approve quotes *without attribution*. Through this mechanism, a company could disclose evidence that speaks to industry-wide issues (examples of reward hacking, lax controls, etc.), which they would individually have incentives not to publish.

5:22 PM · May 20, 2026 · 66 Views
5:26 PM · May 20, 2026 · 25 Views

Companies could exit from the pilot up to the point where they signed off on (redacted and/or anonymized) evidence, but not after. That meant companies knew exactly what non-public evidence we might cite, but couldn't directly control our downstream conclusions, framing, or tone.

Charles FosterCharles Foster@CFGeek

As a backstop, we gave ourselves a carved-out right to publish a redaction summary indicator for each company. That sentence would let us flag whether a company insisted on a redaction we felt was material and blocked us from noting the specific redaction. No participant did.

5:26 PM · May 20, 2026 · 25 Views
5:27 PM · May 20, 2026 · 27 Views

You can find the non-public evidence companies approved in the back of the report. Appendix B covers stuff that companies were OK having attributed to them individually, and Appendix C aggregates statements from across companies. Later appendices also include some CoT excerpts.

Charles FosterCharles Foster@CFGeek

Companies could exit from the pilot up to the point where they signed off on (redacted and/or anonymized) evidence, but not after. That meant companies knew exactly what non-public evidence we might cite, but couldn't directly control our downstream conclusions, framing, or tone.

5:27 PM · May 20, 2026 · 27 Views
5:27 PM · May 20, 2026 · 108 Views

This Frontier Risk Report is also the first time that we've used the AEF-1 standard from @aievalforum. I think it's important for organizations like METR to be transparent and accountable to the public ourselves, not just demand it of AI companies.

Charles FosterCharles Foster@CFGeek

You can find the non-public evidence companies approved in the back of the report. Appendix B covers stuff that companies were OK having attributed to them individually, and Appendix C aggregates statements from across companies. Later appendices also include some CoT excerpts.

5:27 PM · May 20, 2026 · 108 Views
5:27 PM · May 20, 2026 · 93 Views