/Tech27d ago

Microsoft AI CEO Mustafa Suleyman predicts AI training compute will scale by three orders of magnitude over the next three years

This would push frontier model training to 5e30 FLOPs.

--0--

#593

Original post

Andrew Curran@AndrewCurran_#682inTech

At Microsoft Build today, Mustafa Suleyman predicted three more OOM jumps in the amount of training compute between now and summer 2029.

12:32 PM · Jun 2, 2026 · 18.3K Views

Sentiment

Users are optimistic about Mustafa Suleyman's prediction of three more orders of magnitude in AI compute by 2029 because it could enable AGI and accelerate progress beyond linear forecasts.

Pos

100.0%

Neg

0.0%

3 comments with sentiment.

Cluster Engagement

Views

Comments

Reposts

Bookmarks

Expand data

Digg Deeper

No Digg Deeper questions have been answered for this story yet.

Posts from X

Most Activity

VIEWS29.8KBOOKMARKS56LIKES332RETWEETS24REPLIES12

Lisan al Gaib@scaling01

Mustafa Suleyman, CEO of Microsoft AI says that AI compute will grow 1000x in the next 3 years

We are currently at around 5e27 FLOPs. Three more OOMs mean we reach 5e30 FLOPs in 2029.

27d29.8K33256

Andrew Curran@AndrewCurran_

At Microsoft Build today, Mustafa Suleyman predicted three more OOM jumps in the amount of training compute between now and summer 2029.

27d2.3K171

Chris Paxton@chris_j_paxton

Before agents I think this likely would not have mattered (not enough data in the world) but with RL environments driving much progress these days i think OOM leaps in training compute will make much more complex computer use tasks feasible

Andrew Curran@AndrewCurran_

At Microsoft Build today, Mustafa Suleyman predicted three more OOM jumps in the amount of training compute between now and summer 2029.

27d2.3K170

Lisan al Gaib@scaling01

https://www.youtube.com/watch?v=HG0twQJ7aG4

27d3.4K1

Bojan Tunguz@tunguz

Andrew Curran@AndrewCurran_

At Microsoft Build today, Mustafa Suleyman predicted three more OOM jumps in the amount of training compute between now and summer 2029.

27d3K80

Josh You@justjoshinyou13

@AndrewCurran_ Does he mean relative to the frontier today?

if the baseline for that is MAI-Thinking-1, that's actually not ambitious at all.

27d925

Andrew Curran@AndrewCurran_

@justjoshinyou13 He means relative to total compute globally, he also said that MAI-1 is only 35b.

27d714

Shman@TheShmanuel

@AndrewCurran_ One a year? That seems very optimistic. We are only going one oom higher than gpt 4 now 3 years later

27d271

haro@harobuilds

@scaling01 three orders of magnitude is a slide. the question is whether the software consuming that compute is doing anything worth the electricity bill

27d1043

Shman@TheShmanuel

@AndrewCurran_ @justjoshinyou13 Active I assume

27d31

Scott@scottstts

@AndrewCurran_ Did he say how many white collar jobs will be wiped again on stage?

27d562

cqk@cqkten

@AndrewCurran_ 😵‍💫

27d332

Eclipse 🌖@ECLresearch

@scaling01 1000x in 3 years implies a ~40%MoM compound—feasible only if H100-scale clusters double every 4 months and inference becomes the dominant load. Would be interested in the split between training vs inference in that projection.

27d841

Michael R Dawley Jr@mrdj1968

@AndrewCurran_ Demis just modified his AGI speculation to match Kurzweil’s 2029 ... this would fit ...

27d351

Gana@Gana_L_

@scaling01 I just hope most of it will be pre-training instead of RL hill climbing

27d100

mech_eng@RandomCSFan1

@scaling01 Let’s gooooooooo!

27d67

surreal intelligence@Surreal_Intel

@AndrewCurran_ Three more OOM jumps by 2029 is not a forecast. It is an instruction to every institution watching the curve.

Assume today’s impossible becomes procurement, policy and product planning before the decade is out.

27d181

Neuralease@neuralease

@AndrewCurran_ I bet one more OOM will be more than enough for RSI and AGI.

At 3 OOM it'd have similar computational complexity to the human brain and possibly have emergent properties that even we don't.

27d171

retto@rettooooo

@scaling01 feeling the flops

27d25

Tiago Rama@tiagobuilds

@scaling01 If that compute curve holds, most AI predictions are probably too linear.

The gap between demo and default can close fast.

27d20