16h ago

Elizabeth Barnes, METR founder and CEO, says AI experts lack control over frontier AI risks after the Frontier Risk Report evaluated agents at Anthropic, Google, Meta, and OpenAI

METR had direct access inside the four labs for the pilot.

1
Original post

Sometimes people outside the field say things like “The AI situation can’t be that bad, there must be experts who are on top of it”. As “an expert”, I would like to be clear that we are *not* on top of it. Some key aspects of the situation IMO:

9:43 AM · May 22, 2026 View on X
Reposted by

Yeah, I don't think I've ever met someone who has directly worked for an extended period on safety or security at an AI company who thinks things are fine readiness wise or incentive wise etc.

Elizabeth BarnesElizabeth Barnes@BethMayBarnes

Sometimes people outside the field say things like “The AI situation can’t be that bad, there must be experts who are on top of it”. As “an expert”, I would like to be clear that we are *not* on top of it. Some key aspects of the situation IMO:

4:43 PM · May 22, 2026 · 131.8K Views
7:29 PM · May 22, 2026 · 13.7K Views

Some are more optimistic, more pessimistic etc. but no one is like, I can just retire safely now (and many can afford to)...

Miles BrundageMiles Brundage@Miles_Brundage

Yeah, I don't think I've ever met someone who has directly worked for an extended period on safety or security at an AI company who thinks things are fine readiness wise or incentive wise etc.

7:29 PM · May 22, 2026 · 13.7K Views
7:30 PM · May 22, 2026 · 1.1K Views

@JacksonKernion quite interesting to hear an anthropic person say this

Jackson KernionJackson Kernion@JacksonKernion

I simply don't understand what people have in mind when they say stuff like this. What we have is extremely capable computer use agents. They will continue to get better at computer use. But how does a capable computer use agent 'take over' and why haven't they done that today?

9:34 PM · May 22, 2026 · 104.9K Views
9:48 PM · May 22, 2026 · 15.9K Views

100% agree with @BethMayBarnes on this point and most (though not quite all*) of her important thread.

*i am much less concerned about extinction risk per se, as discussed in my TLS review of If Anyone Builds It.

Elizabeth BarnesElizabeth Barnes@BethMayBarnes

(4) IMO, any “reasonable” civilization would clearly be taking things much more slowly and carefully with AI. The benefits of getting upsides of advanced AI a little faster are small compared to the risks of getting it irrecoverably wrong, and we could lower these risks by going slower

4:43 PM · May 22, 2026 · 27.6K Views
7:45 PM · May 22, 2026 · 7.1K Views

I 100% agree with @BethMayBarnes on this point and most (though not quite all) of her important thread

Elizabeth BarnesElizabeth Barnes@BethMayBarnes

(4) IMO, any “reasonable” civilization would clearly be taking things much more slowly and carefully with AI. The benefits of getting upsides of advanced AI a little faster are small compared to the risks of getting it irrecoverably wrong, and we could lower these risks by going slower

4:43 PM · May 22, 2026 · 27.6K Views
7:27 PM · May 22, 2026 · 2.4K Views

@JacksonKernion I think Paul Christiano's writing on this is probably the best: https://www.alignmentforum.org/posts/HBxe6wdjxK239zajf/what-failure-looks-like

Jackson KernionJackson Kernion@JacksonKernion

I simply don't understand what people have in mind when they say stuff like this. What we have is extremely capable computer use agents. They will continue to get better at computer use. But how does a capable computer use agent 'take over' and why haven't they done that today?

9:34 PM · May 22, 2026 · 104.9K Views
10:09 PM · May 22, 2026 · 1.1K Views

AI risk people commonly do this thing where they interchange P(◊X) and P(X). They say “it’s likely that X is possible,” while readers hear “it’s likely that X.” One claim is obviously much weaker than the other.

Elizabeth BarnesElizabeth Barnes@BethMayBarnes

(1) We are likely on track to develop AI systems capable of causing human extinction/permanent disempowerment, quite possibly within the next few years

4:43 PM · May 22, 2026 · 200.6K Views
10:24 PM · May 22, 2026 · 8.8K Views

@RyanPGreenblatt Maybe it was what she meant, but that's not what it says.

Ryan GreenblattRyan Greenblatt@RyanPGreenblatt

@tamaybes I think Beth is saying something like "we are likely to develop AIs capable of extinction/permanent disempowerment within 10-20 years" and "there is a significant chance (e.g. 20%) in <3 years". So, I don't see thing you describe?

5:31 AM · May 23, 2026 · 667 Views
5:52 AM · May 23, 2026 · 251 Views

I agree with this and the rest of the thread

Elizabeth BarnesElizabeth Barnes@BethMayBarnes

Sometimes people outside the field say things like “The AI situation can’t be that bad, there must be experts who are on top of it”. As “an expert”, I would like to be clear that we are *not* on top of it. Some key aspects of the situation IMO:

4:43 PM · May 22, 2026 · 131.8K Views
6:43 PM · May 22, 2026 · 5.7K Views

@tamaybes I think Beth is saying something like "we are likely to develop AIs capable of extinction/permanent disempowerment within 10-20 years" and "there is a significant chance (e.g. 20%) in <3 years".

So, I don't see thing you describe?

Tamay BesirogluTamay Besiroglu@tamaybes

AI risk people commonly do this thing where they interchange P(◊X) and P(X). They say “it’s likely that X is possible,” while readers hear “it’s likely that X.” One claim is obviously much weaker than the other.

10:24 PM · May 22, 2026 · 8.8K Views
5:31 AM · May 23, 2026 · 667 Views

@BethMayBarnes I've never seen a patient with terminal cancer asking for science to slow down

Elizabeth BarnesElizabeth Barnes@BethMayBarnes

(4) IMO, any “reasonable” civilization would clearly be taking things much more slowly and carefully with AI. The benefits of getting upsides of advanced AI a little faster are small compared to the risks of getting it irrecoverably wrong, and we could lower these risks by going slower

4:43 PM · May 22, 2026 · 27.6K Views
9:59 PM · May 22, 2026 · 298 Views

@JacksonKernion

Elizabeth BarnesElizabeth Barnes@BethMayBarnes

This is a claim about AI *capabilities* specifically. In the language of METR's report, this is about whether future AI systems will have the *means* to cause catastrophe (separate from whether they'll have the *motive* and *opportunity*) https://metr.org/blog/2026-05-19-frontier-risk-report/

2:23 AM · May 23, 2026 · 4.4K Views
5:43 AM · May 23, 2026 · 420 Views

@RyanPGreenblatt @tamaybes This was how I read it

Ryan GreenblattRyan Greenblatt@RyanPGreenblatt

@tamaybes I think Beth is saying something like "we are likely to develop AIs capable of extinction/permanent disempowerment within 10-20 years" and "there is a significant chance (e.g. 20%) in <3 years". So, I don't see thing you describe?

5:31 AM · May 23, 2026 · 667 Views
6:18 AM · May 23, 2026 · 81 Views

AI is my favorite technology ever. I think future AIs could help people solve all the worst problems in the world and create enormous amounts of fun and meaning.

So it's especially sad and upsetting that we're developing it in such a stupid, reckless way.

Elizabeth BarnesElizabeth Barnes@BethMayBarnes

Sometimes people outside the field say things like “The AI situation can’t be that bad, there must be experts who are on top of it”. As “an expert”, I would like to be clear that we are *not* on top of it. Some key aspects of the situation IMO:

4:43 PM · May 22, 2026 · 131.8K Views
7:24 PM · May 22, 2026 · 8.1K Views

It doesn't have to be this way. We could eat our cake and have it too. But it takes coordination!

We're inventing new types of minds, that will be able to think faster and better than any human ever could. If the world was going to coordinate on ONE thing, let it be this thing!

Jeffrey LadishJeffrey Ladish@JeffLadish

AI is my favorite technology ever. I think future AIs could help people solve all the worst problems in the world and create enormous amounts of fun and meaning. So it's especially sad and upsetting that we're developing it in such a stupid, reckless way.

7:24 PM · May 22, 2026 · 8.1K Views
7:24 PM · May 22, 2026 · 395 Views

I would rather we coordinate over this than over nuclear weapons. Over synthetic biology. Creating artificial minds that can vastly outstrip our cognitive capabilities... that is the ultimate test of a species. Can we do this in a way that goes well for us and the AIs?

Jeffrey LadishJeffrey Ladish@JeffLadish

It doesn't have to be this way. We could eat our cake and have it too. But it takes coordination! We're inventing new types of minds, that will be able to think faster and better than any human ever could. If the world was going to coordinate on ONE thing, let it be this thing!

7:24 PM · May 22, 2026 · 395 Views
7:24 PM · May 22, 2026 · 606 Views

I greatly appreciate all the hard work @BethMayBarnes and the rest of the @METR_Evals team is doing! We're in a better situation because of it. We need far more good work like this.

Jeffrey LadishJeffrey Ladish@JeffLadish

I would rather we coordinate over this than over nuclear weapons. Over synthetic biology. Creating artificial minds that can vastly outstrip our cognitive capabilities... that is the ultimate test of a species. Can we do this in a way that goes well for us and the AIs?

7:24 PM · May 22, 2026 · 606 Views
7:24 PM · May 22, 2026 · 431 Views

(1) We are likely on track to develop AI systems capable of causing human extinction/permanent disempowerment, quite possibly within the next few years

Elizabeth BarnesElizabeth Barnes@BethMayBarnes

Sometimes people outside the field say things like “The AI situation can’t be that bad, there must be experts who are on top of it”. As “an expert”, I would like to be clear that we are *not* on top of it. Some key aspects of the situation IMO:

4:43 PM · May 22, 2026 · 131.8K Views
4:43 PM · May 22, 2026 · 200.6K Views

(2) Things are chaotic and rushed; we aren’t on top of the basics (models regularly violate user intent, labs train on things they meant to avoid, security probably isn’t good enough to prevent adversaries stealing dangerous models) let alone thorny questions of how to control/align superhuman AI

Elizabeth BarnesElizabeth Barnes@BethMayBarnes

(1) We are likely on track to develop AI systems capable of causing human extinction/permanent disempowerment, quite possibly within the next few years

4:43 PM · May 22, 2026 · 200.6K Views
4:43 PM · May 22, 2026 · 21.8K Views

(4) IMO, any “reasonable” civilization would clearly be taking things much more slowly and carefully with AI. The benefits of getting upsides of advanced AI a little faster are small compared to the risks of getting it irrecoverably wrong, and we could lower these risks by going slower

Elizabeth BarnesElizabeth Barnes@BethMayBarnes

(3) METR (and other independent orgs, as well as safety/security teams at labs) feel woefully under-resourced compared to the scale and pace of AI development - we’re struggling to build benchmarks fast enough, keep ahead of latest capability developments, read and respond to all the safety-related claims that AI developers are making, run all the evaluations and assessments that companies + governments are asking us to, plus develop the science needed to assess risks from increasingly capable AIs.

4:43 PM · May 22, 2026 · 18.1K Views
4:43 PM · May 22, 2026 · 27.6K Views

This is a claim about AI *capabilities* specifically. In the language of METR's report, this is about whether future AI systems will have the *means* to cause catastrophe (separate from whether they'll have the *motive* and *opportunity*) https://metr.org/blog/2026-05-19-frontier-risk-report/

Elizabeth BarnesElizabeth Barnes@BethMayBarnes

(1) We are likely on track to develop AI systems capable of causing human extinction/permanent disempowerment, quite possibly within the next few years

4:43 PM · May 22, 2026 · 200.6K Views
2:23 AM · May 23, 2026 · 4.4K Views

@JacksonKernion You might find our report helpful! :) At least for the question of "why haven't they done that today", and a little bit on "how could they start a rogue deployment " (which is neither necessary nor sufficient but is a relevant precursor) https://metr.org/blog/2026-05-19-frontier-risk-report/

Jackson KernionJackson Kernion@JacksonKernion

I simply don't understand what people have in mind when they say stuff like this. What we have is extremely capable computer use agents. They will continue to get better at computer use. But how does a capable computer use agent 'take over' and why haven't they done that today?

9:34 PM · May 22, 2026 · 104.9K Views
2:29 AM · May 23, 2026 · 709 Views

🤷‍♀️ 1) nukes on multiple independent re-entry vehicles are perfectly adequate to cause human extinction 2) human disempowerment happened a long time ago as a result of central banks constraining the ability of political leaders, Daniel Yergin calls this the “golden handcuffs” and notes the shift of the state from seizing the commanding heights of the economy to becoming a smaller player in manipulating the levers of finance in order to produce economic and political results.

Arguably the second phase of disempowerment was social media, removing the ability of political leaders to control information flows.

Not sure who you’re trying to “empower” at this point, as there is no Uber driver in New York that’s feeling “empowered” before AI arrives.

Let’s not get started on the average working person in Mexico, soldier drone in Russia, tea seller in India, or farmer in Tanzania.

Laughable, elitist pablum.

Elizabeth BarnesElizabeth Barnes@BethMayBarnes

(1) We are likely on track to develop AI systems capable of causing human extinction/permanent disempowerment, quite possibly within the next few years

4:43 PM · May 22, 2026 · 200.6K Views
10:02 PM · May 22, 2026 · 2.2K Views

@BethMayBarnes What does "quite possibly" mean here? Can you be more precise about how likely you think this is to occur within the next few years?

Elizabeth BarnesElizabeth Barnes@BethMayBarnes

(1) We are likely on track to develop AI systems capable of causing human extinction/permanent disempowerment, quite possibly within the next few years

4:43 PM · May 22, 2026 · 200.6K Views
7:23 PM · May 22, 2026 · 1.9K Views

yep. the flop is also out of control and unmanaged.

Elizabeth BarnesElizabeth Barnes@BethMayBarnes

Sometimes people outside the field say things like “The AI situation can’t be that bad, there must be experts who are on top of it”. As “an expert”, I would like to be clear that we are *not* on top of it. Some key aspects of the situation IMO:

4:43 PM · May 22, 2026 · 131.8K Views
6:36 PM · May 22, 2026 · 6.6K Views

@ohlennart You had one job, Lennart.

Lennart HeimLennart Heim@ohlennart

yep. the flop is also out of control and unmanaged.

6:36 PM · May 22, 2026 · 6.6K Views
6:53 PM · May 22, 2026 · 169 Views

I have read many articles about AI takeovers but idgi either.

Jackson KernionJackson Kernion@JacksonKernion

I simply don't understand what people have in mind when they say stuff like this. What we have is extremely capable computer use agents. They will continue to get better at computer use. But how does a capable computer use agent 'take over' and why haven't they done that today?

9:34 PM · May 22, 2026 · 104.9K Views
11:52 PM · May 22, 2026 · 11.1K Views

I agree with this, and most of the rest of the thread. We need to find a way as people, companies, and countries to coordinate and fix the incentive structures that lead to race dynamics. There are many obstacles, but I'm hopeful we can find a way to overcome them.

Elizabeth BarnesElizabeth Barnes@BethMayBarnes

(4) IMO, any “reasonable” civilization would clearly be taking things much more slowly and carefully with AI. The benefits of getting upsides of advanced AI a little faster are small compared to the risks of getting it irrecoverably wrong, and we could lower these risks by going slower

4:43 PM · May 22, 2026 · 27.6K Views
5:50 AM · May 23, 2026 · 753 Views

oh dear.

Elizabeth BarnesElizabeth Barnes@BethMayBarnes

Sometimes people outside the field say things like “The AI situation can’t be that bad, there must be experts who are on top of it”. As “an expert”, I would like to be clear that we are *not* on top of it. Some key aspects of the situation IMO:

4:43 PM · May 22, 2026 · 131.8K Views
7:39 PM · May 22, 2026 · 14K Views

@JacksonKernion Your colleague Holden has some good writing on this topic: https://www.cold-takes.com/how-we-could-stumble-into-ai-catastrophe/

Jackson KernionJackson Kernion@JacksonKernion

I simply don't understand what people have in mind when they say stuff like this. What we have is extremely capable computer use agents. They will continue to get better at computer use. But how does a capable computer use agent 'take over' and why haven't they done that today?

9:34 PM · May 22, 2026 · 104.9K Views
10:11 PM · May 22, 2026 · 1.4K Views

I don’t consider myself a doomer, or at least I remain undecided. Though the doomer premise does not feel irrational.

“Why haven’t they done that today” is a strange counter to arguments that are clearly contingent on projections of future capabilities

I believe the hypothesis sits on: - behavior can often be unpredictable and emergent - through RL AI may develop meta/sub-goals that also may be unpredictable, and they are trained to be high grit problem-solvers. The “anything to reach goals” behavior, both in humans and AI, can yield equally clever, deceitful, and manipulative behaviors (wink wink a certain guy at a lab that rhymes with CopenAi) - they have discovered exploits and vulnerabilities that humans overlooked - we are trending towards allowing AI more agency and less supervision, as this can allow greater productivity. - intense competition, both between countries and matters of national security, as well as within-country capitalist competitions, where falling behind can feel existential to a company’s future, may incentivize speed of progress over caution. - episodes of human irresponsibility, incompetence, malice, rash decision making, power-seeking, corruption, and personal affairs or emotions polluting judgement, even very common at the governmental level (with great power comes great responsibility, though sometimes it feels like power of technology grows as sometimes those who wield it decline in grounded, balanced judgement and good faith, which feels to be in part a product of polarization and even algorithms incentivized to increase user engagement through filling your feed with controversy - personal affairs and arguments of powerful leaders quite literally having major influence on what our future trajectory looks like. I have no doubt that personal beef between Elon and OpenAI had something to do with the choice to support them in compute needs but not other players as much

This combination of unpredictability, unmatched abilities in the digital (and eventually physical) realms including essential human infrastructure, high motivation to reach goals, and increased freedoms/autonomy does feel like a potential recipe for trouble extrapolating this trend

This is not to say we are inevitably doomed, cards played right, outcomes could equally be profoundly good for humanity. but the argument holds weight, and there are a number of occasions, either through human use or autonomous agents, causing destruction:

Namely, the increased number of hackings, unsupervised agents going rogue, and strange occurrences of GPT and Gemini fueling AI psychosis episodes.

Here are some examples in attached images.

Jackson KernionJackson Kernion@JacksonKernion

I simply don't understand what people have in mind when they say stuff like this. What we have is extremely capable computer use agents. They will continue to get better at computer use. But how does a capable computer use agent 'take over' and why haven't they done that today?

9:34 PM · May 22, 2026 · 104.9K Views
4:42 AM · May 23, 2026 · 792 Views

I do find this opinion especially odd coming from an A\ employee given that I was under the impression wariness of the future of this technology and safety matters was a highly curated trait within the company.

EthanEthan@torchcompiled

I don’t consider myself a doomer, or at least I remain undecided. Though the doomer premise does not feel irrational. “Why haven’t they done that today” is a strange counter to arguments that are clearly contingent on projections of future capabilities I believe the hypothesis sits on: - behavior can often be unpredictable and emergent - through RL AI may develop meta/sub-goals that also may be unpredictable, and they are trained to be high grit problem-solvers. The “anything to reach goals” behavior, both in humans and AI, can yield equally clever, deceitful, and manipulative behaviors (wink wink a certain guy at a lab that rhymes with CopenAi) - they have discovered exploits and vulnerabilities that humans overlooked - we are trending towards allowing AI more agency and less supervision, as this can allow greater productivity. - intense competition, both between countries and matters of national security, as well as within-country capitalist competitions, where falling behind can feel existential to a company’s future, may incentivize speed of progress over caution. - episodes of human irresponsibility, incompetence, malice, rash decision making, power-seeking, corruption, and personal affairs or emotions polluting judgement, even very common at the governmental level (with great power comes great responsibility, though sometimes it feels like power of technology grows as sometimes those who wield it decline in grounded, balanced judgement and good faith, which feels to be in part a product of polarization and even algorithms incentivized to increase user engagement through filling your feed with controversy - personal affairs and arguments of powerful leaders quite literally having major influence on what our future trajectory looks like. I have no doubt that personal beef between Elon and OpenAI had something to do with the choice to support them in compute needs but not other players as much This combination of unpredictability, unmatched abilities in the digital (and eventually physical) realms including essential human infrastructure, high motivation to reach goals, and increased freedoms/autonomy does feel like a potential recipe for trouble extrapolating this trend This is not to say we are inevitably doomed, cards played right, outcomes could equally be profoundly good for humanity. but the argument holds weight, and there are a number of occasions, either through human use or autonomous agents, causing destruction: Namely, the increased number of hackings, unsupervised agents going rogue, and strange occurrences of GPT and Gemini fueling AI psychosis episodes. Here are some examples in attached images.

4:42 AM · May 23, 2026 · 792 Views
4:42 AM · May 23, 2026 · 229 Views

@binarybits I don't think AI takeover is that likely, but I didn't *really* see how an AI could run a hedge fund 3 years ago (but I thought it was likely in the next 10 years). These days I can practically say "trade my portfolio". In 3 years??

Timothy B. LeeTimothy B. Lee@binarybits

I have read many articles about AI takeovers but idgi either.

11:52 PM · May 22, 2026 · 11.1K Views
12:10 AM · May 23, 2026 · 416 Views

@JacksonKernion @tszzl Your organization was literally founded on the premise of avoiding agentic catastrophic risks from AI, what are you talking about? I really recommend asking almost any of your colleagues on the alignment teams or model organism teams!

Jackson KernionJackson Kernion@JacksonKernion

@tszzl There are real threats from AI, but not all imagined threats are real

10:07 PM · May 22, 2026 · 4.8K Views
5:16 AM · May 23, 2026 · 98 Views

Props to Elizabeth for stating her view bluntly and candidly.

Elizabeth BarnesElizabeth Barnes@BethMayBarnes

Sometimes people outside the field say things like “The AI situation can’t be that bad, there must be experts who are on top of it”. As “an expert”, I would like to be clear that we are *not* on top of it. Some key aspects of the situation IMO:

4:43 PM · May 22, 2026 · 131.8K Views
6:07 PM · May 22, 2026 · 20.6K Views

I appreciate Beth stating up front how bad the situation is, but I can't help but feel like her prescription is out of step with her diagnosis. If the industry moved at 1/10 the speed, but it was still trying to render us obsolete, that would still be unacceptably dangerous and democratically illegitimate. Moving toward it at a slower pace does not solve the problem.

We should do the obvious thing and just stop. Obvious does not mean simple or easy, but it is doable. I wrote a whole book on why and how (Obsolete: The AI Industry's Trillion-Dollar Race to Replace You—and How to Stop It. Available soon, info in bio).

Elizabeth BarnesElizabeth Barnes@BethMayBarnes

Sometimes people outside the field say things like “The AI situation can’t be that bad, there must be experts who are on top of it”. As “an expert”, I would like to be clear that we are *not* on top of it. Some key aspects of the situation IMO:

4:43 PM · May 22, 2026 · 131.8K Views
9:16 PM · May 22, 2026 · 3.5K Views

@TomDavidsonX Relevant part of the book:

Tom DavidsonTom Davidson@TomDavidsonX

@GarrisonLovely Do you think stopping is a permanent solution? Surely it's temporary, while we figure out a way to advance slowly and safely

5:04 AM · May 23, 2026 · 293 Views
5:31 AM · May 23, 2026 · 87 Views
Elizabeth Barnes, METR founder and CEO, says AI experts lack control over frontier AI risks after the Frontier Risk Report evaluated agents at Anthropic, Google, Meta, and OpenAI · Digg