1d ago

Terence Tao Highlights Simple Math Behind LLMs and Unpredictable Performance

922.9K5132.2K457.9K

——0——

Original post

Rohan Paul#1032@ROHANPAUL_AI

https://www.youtube.com/watch?v=ukpCHo5v-Gc

5:28 AM · May 16, 2026

Cluster engagement

144 snapshots

Reposted by

#1032@ROHANPAUL_AI

#1032Rohan Paul@ROHANPAUL_AI

youtube.com

Terence Tao: Nobody Understands Why AI Actually Works

Terence Tao says AI can produce perfect-looking answers that are fundamentally wrong — and most people have no way to detect it. The skill that matters now i...

Rohan Paul@rohanpaul_ai

Terence Tao says the math behind today’s LLMs is actually simple. Training and running them mostly uses linear algebra, matrix multiplication, and a bit of calculus, material an undergraduate can handle. We understand how to build and operate these models. The real mystery is why they work so well on some tasks and fail on others, and why we cannot predict that in advance. We lack good rules for forecasting performance across tasks, so progress is largely empirical. A key reason is the nature of real-world data. Pure noise is well understood, perfectly structured data is well understood, but natural text sits in between, partly structured and partly random. Mathematics for that middle regime is thin, similar to how physics struggles at meso-scales between atoms and continua. Because of this gap, we can describe the mechanisms but cannot yet explain capability jumps or give reliable task-level predictions. That mismatch, simple machinery versus hard-to-predict behavior, is the core puzzle. ---- Video from 'Dr Brian Keating' YT Channel (Link in comment)

12:28 PM · May 16, 2026 · 443.3K Views

12:28 PM · May 16, 2026 · 14.7K Views

ORIGINAL POST

#1032Rohan Paul@ROHANPAUL_AI

The real mystery is why they work so well on some tasks and fail on others, and why we cannot predict that in advance. We lack good rules for forecasting performance across tasks, so progress is largely empirical.

A key reason is the nature of real-world data. Pure noise is well understood, perfectly structured data is well understood, but natural text sits in between, partly structured and partly random. Mathematics for that middle regime is thin, similar to how physics struggles at meso-scales between atoms and continua.

Because of this gap, we can describe the mechanisms but cannot yet explain capability jumps or give reliable task-level predictions. That mismatch, simple machinery versus hard-to-predict behavior, is the core puzzle.

----

Video from 'Dr Brian Keating' YT Channel (Link in comment)

12:28 PM · May 16, 2026 · 443.3K Views