Elizabeth Barnes, METR founder and CEO, outlined two possible readings of ambiguous AI capability terminology distinguishing near-term potential from realized task outcomes

REPLY

@tamaybes @RyanPGreenblatt I realized 2 is a grammatical ambiguity, I didn't intend the parsing of "develop AI capable of [possibly causing z within a few years]", meant "possibly develop [AI capable of causing z] within a few years"

Elizabeth Barnes@BethMayBarnes

@tamaybes @RyanPGreenblatt Genuinely unsure what you mean here. Some guesses: 1. ◊X ="AI capable of z" and X = "AI that will in fact succeed at doing z" (or "z will happen")? 2. ◊X = "AI capable of [causing z within a few years]", X="AI capable of z"?

7:31 PM · May 23, 2026 · 44 Views

7:33 PM · May 23, 2026 · 29 Views

REPLY

#1325Elizabeth Barnes@BETHMAYBARNES

@MatthewJBar @BronsonSchoen @tamaybes @GuiveAssadi It seems very reasonable to me to think capability progress this rapid is pretty unlikely (e.g. <15% or something) but I'd find it hard to be very confident it's not possible (e.g. <1%) - curious how confident you are here?

7:43 PM · May 23, 2026 · 96 Views

REPLY

#1325Elizabeth Barnes@BETHMAYBARNES

@MatthewJBar @BronsonSchoen @tamaybes @GuiveAssadi Ah, I'm not sure we actually disagree a ton on substance then - maybe just a communication issue? "quite possibly" is pretty compatible with 5% in my mind. Is this a "brits and americans use 'quite' differently" thing maybe?

Matthew Barnett@MatthewJBar

@BethMayBarnes @BronsonSchoen @tamaybes @GuiveAssadi I think it's logically possible but very unlikely (<5%).

7:59 PM · May 23, 2026 · 71 Views

8:04 PM · May 23, 2026 · 80 Views

REPLY

#1325Elizabeth Barnes@BETHMAYBARNES

@MatthewJBar @BronsonSchoen @tamaybes @GuiveAssadi Or maybe a combination of somewhat different view on risk level, and different sense of what our obligations in public communication are? Ie level of concern about "alarmism" vs "failure to disclose risk"?

Elizabeth Barnes@BethMayBarnes

@MatthewJBar @BronsonSchoen @tamaybes @GuiveAssadi Ah, I'm not sure we actually disagree a ton on substance then - maybe just a communication issue? "quite possibly" is pretty compatible with 5% in my mind. Is this a "brits and americans use 'quite' differently" thing maybe?

8:04 PM · May 23, 2026 · 80 Views

8:09 PM · May 23, 2026 · 27 Views

REPLY

#1325Elizabeth Barnes@BETHMAYBARNES

@MatthewJBar @BronsonSchoen @tamaybes @GuiveAssadi Agree this matters a lot! Although seems to me that it's still useful to know that it's not <1% and not >90%, and get everyone on the same page about that, and take some of the actions that make sense if all you know is you're in that range.

Matthew Barnett@MatthewJBar

@BethMayBarnes @BronsonSchoen @tamaybes @GuiveAssadi I think in these contexts it's important to communicate clearly and precisely. It matters a lot whether AI systems are 5% vs. 75% likely to be capable of making humanity go extinct within 3 years.

8:23 PM · May 23, 2026 · 70 Views

8:39 PM · May 23, 2026 · 42 Views

REPLY

#1343Matthew Barnett@MATTHEWJBAR

@BethMayBarnes @BronsonSchoen @tamaybes @GuiveAssadi I think it's logically possible but very unlikely (<5%).

Elizabeth Barnes@BethMayBarnes

@MatthewJBar @BronsonSchoen @tamaybes @GuiveAssadi It seems very reasonable to me to think capability progress this rapid is pretty unlikely (e.g. <15% or something) but I'd find it hard to be very confident it's not possible (e.g. <1%) - curious how confident you are here?

7:43 PM · May 23, 2026 · 96 Views

7:59 PM · May 23, 2026 · 71 Views

REPLY

#1343Matthew Barnett@MATTHEWJBAR

@BethMayBarnes @BronsonSchoen @tamaybes @GuiveAssadi I think in these contexts it's important to communicate clearly and precisely. It matters a lot whether AI systems are 5% vs. 75% likely to be capable of making humanity go extinct within 3 years.

Elizabeth Barnes@BethMayBarnes

@MatthewJBar @BronsonSchoen @tamaybes @GuiveAssadi I dunno, seems quite possible that "fairly likely" is compatible with 5% :P (I feel a bit torn about how much to use explicit probabilities; obviously desirable to avoid miscommunications, but can feel unrepresentative of actual epistemic state)

8:19 PM · May 23, 2026 · 54 Views

8:23 PM · May 23, 2026 · 70 Views

Elizabeth Barnes, METR founder and CEO, outlined two possible readings of ambiguous AI capability terminology distinguishing near-term potential from realized task outcomes

Sentiment

Cluster engagement