/Tech1d ago

Dwarkesh Patel argues that deep learning won because computational logic costs fell faster than data transfer costs

Dustin Tran says cheap data transfer favors graphical models.

243612526375.4K

#60

Original post

Dwarkesh Patel@dwarkesh_sp#60inTech

How many of the big ideas of the past 15 years of AI are downstream of hardware constraints?

The big hardware story over that period is that logic has become way cheaper than data transfer.

Stacking huge numbers of matrix multiplies was perfect for this hardware regime, because matrix multiplication is logic-intensive but requires less data transfer. And so we got matmul-heavy deep learning.

It's interesting to think about what AI would look like in a world where these costs didn't diverge so much.

2:24 PM · Jun 24, 2026 · 53.5K Views

Sentiment

Users appreciate the framing that hardware costs drove deep learning toward matmul architectures over graphical models because it clarifies why message passing never competed once compute scaled.

Pos

100.0%

Neg

0.0%

1 comments with sentiment.

Cluster Engagement

Digg Deeper

No Digg Deeper questions have been answered for this story yet.

DWARKESH.COMVia

#60

Posts from X

Most Activity

VIEWS14.8KBOOKMARKS39LIKES46RETWEETS4REPLIES2

Dustin Tran@dustinvtran

How would AI have changed in a counterfactual world where data transfer is cheap and logic is expensive? There's an easy answer imo: transformers + sgd → graphical models + message passing

Dwarkesh Patel@dwarkesh_sp

How many of the big ideas of the past 15 years of AI are downstream of hardware constraints?

The big hardware story over that period is that logic has become way cheaper than data transfer.

It's interesting to think about what AI would look like in a world where these costs didn't diverge so much.

1d14.8K4639

Dwarkesh Patel@dwarkesh_sp

Full episode with Noam Shazeer and Jeff Dean: https://www.dwarkesh.com/p/jeff-dean-and-noam-shazeer

Dwarkesh Patel@dwarkesh_sp

How many of the big ideas of the past 15 years of AI are downstream of hardware constraints?

The big hardware story over that period is that logic has become way cheaper than data transfer.

It's interesting to think about what AI would look like in a world where these costs didn't diverge so much.

1d7.3K2920

Hunter Gon@gonlenidefi

@dustinvtran this framing makes the robbins path vs bayesian thing click in a new way

message passing never stood a chance once compute favored dense matmuls

1d27