METR Researcher Clarifies Companies Lack Editorial Control Over Reports

REPLY

We also offered the option to approve quotes *without attribution*. Through this mechanism, a company could disclose evidence that speaks to industry-wide issues (examples of reward hacking, lax controls, etc.), which they would individually have incentives not to publish.

Charles Foster@CFGeek

Then we told each company what evidence we wanted to quote in our public report and negotiated any redactions. Our agreement carved out certain info, but generally they could block anything we couldn't have acquired ourselves (like by reading the news or evaluating public APIs).

5:22 PM · May 20, 2026 · 61 Views

5:22 PM · May 20, 2026 · 66 Views

REPLY

#1356Charles Foster@CFGEEK

Then we told each company what evidence we wanted to quote in our public report and negotiated any redactions. Our agreement carved out certain info, but generally they could block anything we couldn't have acquired ourselves (like by reading the news or evaluating public APIs).

Charles Foster@CFGeek

As the first step in our process, we ran entity-level assessments for each company. These asked "What's the holistic picture of loss-of-control at [Company] at this point in time?", not "What's the risk from this specific [Company] model launch decision?".

5:22 PM · May 20, 2026 · 78 Views

5:22 PM · May 20, 2026 · 61 Views

REPLY

#1356Charles Foster@CFGEEK

As a backstop, we gave ourselves a carved-out right to publish a redaction summary indicator for each company. That sentence would let us flag whether a company insisted on a redaction we felt was material and blocked us from noting the specific redaction. No participant did.

Charles Foster@CFGeek

We also offered the option to approve quotes *without attribution*. Through this mechanism, a company could disclose evidence that speaks to industry-wide issues (examples of reward hacking, lax controls, etc.), which they would individually have incentives not to publish.

5:22 PM · May 20, 2026 · 66 Views

5:26 PM · May 20, 2026 · 25 Views

REPLY

#1356Charles Foster@CFGEEK

Companies could exit from the pilot up to the point where they signed off on (redacted and/or anonymized) evidence, but not after. That meant companies knew exactly what non-public evidence we might cite, but couldn't directly control our downstream conclusions, framing, or tone.

Charles Foster@CFGeek

As a backstop, we gave ourselves a carved-out right to publish a redaction summary indicator for each company. That sentence would let us flag whether a company insisted on a redaction we felt was material and blocked us from noting the specific redaction. No participant did.

5:26 PM · May 20, 2026 · 25 Views

5:27 PM · May 20, 2026 · 27 Views

REPLY

#1356Charles Foster@CFGEEK

You can find the non-public evidence companies approved in the back of the report. Appendix B covers stuff that companies were OK having attributed to them individually, and Appendix C aggregates statements from across companies. Later appendices also include some CoT excerpts.

Charles Foster@CFGeek

Companies could exit from the pilot up to the point where they signed off on (redacted and/or anonymized) evidence, but not after. That meant companies knew exactly what non-public evidence we might cite, but couldn't directly control our downstream conclusions, framing, or tone.

5:27 PM · May 20, 2026 · 27 Views

5:27 PM · May 20, 2026 · 108 Views

REPLY

#1356Charles Foster@CFGEEK

This Frontier Risk Report is also the first time that we've used the AEF-1 standard from @aievalforum. I think it's important for organizations like METR to be transparent and accountable to the public ourselves, not just demand it of AI companies.

Charles Foster@CFGeek

You can find the non-public evidence companies approved in the back of the report. Appendix B covers stuff that companies were OK having attributed to them individually, and Appendix C aggregates statements from across companies. Later appendices also include some CoT excerpts.

5:27 PM · May 20, 2026 · 108 Views

5:27 PM · May 20, 2026 · 93 Views

METR Researcher Clarifies Companies Lack Editorial Control Over Reports

Sentiment

Cluster engagement