1h ago

Sergey Levine argues LLMs develop emergent capabilities by composing simpler skills in novel ways instead of imitating training data

Anirudh Goyal's co-authored paper provides a mathematical framework.

0
Original post

We pre-train LLMs on the whole of the internet. You might think this explains how they learn so many emergent capabilities: the knowledge is implicit in the training data. But in fact models can do things that were never demonstrated anywhere in training! @svlevine argues that the real source of emergent capabilities is compositionality:

11:58 AM · May 30, 2026 View on X

Check out the full interview with one of the top robotics researchers: https://www.dwarkesh.com/p/sergey-levine

Dwarkesh PatelDwarkesh Patel@dwarkesh_sp

We pre-train LLMs on the whole of the internet. You might think this explains how they learn so many emergent capabilities: the knowledge is implicit in the training data. But in fact models can do things that were never demonstrated anywhere in training! @svlevine argues that the real source of emergent capabilities is compositionality:

6:58 PM · May 30, 2026 · 16.6K Views
6:58 PM · May 30, 2026 · 4.8K Views

@dwarkesh_sp

This is the phenomenon our paper (with @prfsanjeevarora) tried to formalize: as models scale, basic skills can compose into complex skills.

That gives a theory for emergence beyond direct imitation of training data.

arxiv.org
A Theory for Emergence of Complex Skills in Language Models
A major driver of AI products today is the fact that new skills emerge in language models when their parameter set and training corpora are scaled up. This phenomenon is poorly understood, and a...
Dwarkesh PatelDwarkesh Patel@dwarkesh_sp

We pre-train LLMs on the whole of the internet. You might think this explains how they learn so many emergent capabilities: the knowledge is implicit in the training data. But in fact models can do things that were never demonstrated anywhere in training! @svlevine argues that the real source of emergent capabilities is compositionality:

6:58 PM · May 30, 2026 · 16.6K Views
7:27 PM · May 30, 2026 · 309 Views

I'm willing to believe this is true, but I also think people who make statements like this haven't seen the amount of crazy shit that is actually on the internet. It might be very hard for you to track down, but there is almost always an example for anything.

Dwarkesh PatelDwarkesh Patel@dwarkesh_sp

We pre-train LLMs on the whole of the internet. You might think this explains how they learn so many emergent capabilities: the knowledge is implicit in the training data. But in fact models can do things that were never demonstrated anywhere in training! @svlevine argues that the real source of emergent capabilities is compositionality:

6:58 PM · May 30, 2026 · 16.6K Views
8:27 PM · May 30, 2026 · 39 Views