/AI19h ago

OpenAI's Daniel A. Roberts says pairing reinforcement learning with LLMs enables AI to make original scientific and mathematical discoveries

The approach uses verifiable rewards to drive scientific exploration.

--0--
Original posts
Quote posts
Comments
Reposts
Original post
Matt Turck@mattturck#1497inAI

Why AI Can Now Make Discoveries - my conversation with @danintheory, Lead of the Foundations of Reinforcement Learning team at @OpenAI

00:00 Intro: AI's wild week in mathematics

01:21 What OpenAI's Foundations of RL team does

03:08 Dan's journey: from black holes and quantum gravity to frontier AI

07:04 Are AI systems becoming useful for real science

08:21 The AI math moment: Erdős, OpenAI, DeepMind, and Anthropic

08:52 Why the OpenAI result was an act of exploration

10:25 OpenAI vs. DeepMind: informal reasoning vs. formal proof

12:13 RL 101: learning by doing, not just watching

15:10 Why reinforcement learning works

15:58 How RL breaks: sparse feedback and long-horizon tasks

17:03 RLHF: how human feedback shaped early language models

18:48 Move 37, self-play, and the search for novel strategies

22:16 Explore vs. exploit in scientific discovery

24:49 Why RL may now be "the cake," not the cherry on top

25:46 Why RL started working with large language models

27:29 Is RL "sucking supervision through a straw"?

28:47 Why language may be the grounding layer for intelligence

31:46 A contrarian take on the Bitter Lesson

32:41 What test-time compute actually is

34:50 How RL gives models the ability to think

35:40 Verifiable rewards, math, coding, and the messy real world

38:00 What physics can teach us about AI

42:08 Is there a thermodynamics of AI?

43:08 From Erdős problems to Einstein-level AI

45:16 Is AI already doing original science?

45:51 How far are we from AI automating AI research

47:41 Why Dan is excited about the future of science

10:27 AM · Jun 4, 2026 · 14.9K Views
Sentiment
Sentiment building, check back later.
Cluster Engagement
-
Views
-
Comments
-
Reposts
-
Bookmarks
Expand data
Posts from X
Most Activity
Most ActivityTimeline
VIEWS30.5KBOOKMARKS93LIKES350RETWEETS20REPLIES52
OpenAI@OpenAI

What happened when one of our models found a counterexample to an 80-year-old Erdős conjecture?

Researchers @alexwei_, @HongxunWu, and @wjmzbmr1 shared the story on the OpenAI Podcast with @AndrewMayne and explained how mathematicians and models can work together to make new discoveries.

36mViews 30.5KLikes 350Bookmarks 93