Tal Linzen, associate professor at NYU and research scientist at Google, discusses on the Information Bottleneck podcast why children acquire language from roughly 100 million words while large language models require trillions of tokens · Digg

/Tech43d ago

Tal Linzen, associate professor at NYU and research scientist at Google, discusses on the Information Bottleneck podcast why children acquire language from roughly 100 million words while large language models require trillions of tokens

AI Judge changed title after evaluation, original title: "Tal Linzen, associate professor at NYU and research scientist at Google, notes that children learn language from roughly 100 million words while current LLMs require trillions in the latest Information Bottleneck podcast episode"

Stronger next-word prediction in models reduces human processing alignment.

34573210.6K

Original post

Ravid Shwartz Ziv@ziv_ravid#741inTech

New episode of The Information Bottleneck is out with @tallinzen, Associate Professor at NYU and Research Scientist at Google. Tal works at the intersection of cognitive science and language models, and he's one of the clearest voices on what humans and LLMs can actually teach us about each other. We talked about why children learn language from 100M words while LLMs need trillions, the surprising finding that as models get better at predicting the next word they become worse models of humans, inductive biases and synthetic languages, world models and whether transformers actually use them, BabyLM, and how AI coding tools are changing the way he teaches at NYU. I'm sure you will enjoy it!

11:32 AM · May 17, 2026 · 6.6K Views

Sentiment

Sentiment building, check back later.

Cluster Engagement

Digg Deeper

No Digg Deeper questions have been answered for this story yet.

Related links

Language, Cognition, and the Limits of LLMs - with Tal Linzen (N…

THE INFORMATION BOTTLENECKVia

Posts from X

Most Activity

VIEWS3.5KBOOKMARKS14LIKES16

Tal Linzen@tallinzen

This was a super fun conversation, thanks for having me on the podcast!

Ravid Shwartz Ziv@ziv_ravid

New episode of The Information Bottleneck is out with @tallinzen, Associate Professor at NYU and Research Scientist at Google. Tal works at the intersection of cognitive science and language models, and he's one of the clearest voices on what humans and LLMs can actually teach us about each other. We talked about why children learn language from 100M words while LLMs need trillions, the surprising finding that as models get better at predicting the next word they become worse models of humans, inductive biases and synthetic languages, world models and whether transformers actually use them, BabyLM, and how AI coding tools are changing the way he teaches at NYU. I'm sure you will enjoy it!

43d3.5K1614

REPLIES1

Ravid Shwartz Ziv@ziv_ravid

The episode - https://www.the-information-bottleneck.com/language-cognition-and-the-limits-of-llms/

Ravid Shwartz Ziv@ziv_ravid

New episode of The Information Bottleneck is out with @tallinzen, Associate Professor at NYU and Research Scientist at Google. Tal works at the intersection of cognitive science and language models, and he's one of the clearest voices on what humans and LLMs can actually teach us about each other. We talked about why children learn language from 100M words while LLMs need trillions, the surprising finding that as models get better at predicting the next word they become worse models of humans, inductive biases and synthetic languages, world models and whether transformers actually use them, BabyLM, and how AI coding tools are changing the way he teaches at NYU. I'm sure you will enjoy it!

43d47511