/AI19h ago

OpenAI's Daniel A. Roberts says pairing reinforcement learning with LLMs enables AI to make original scientific and mathematical discoveries

The approach uses verifiable rewards to drive scientific exploration.

553353011138.8K

Original posts

Quote posts

Comments

Reposts

Original post

Matt Turck@mattturck#1497inAI

Why AI Can Now Make Discoveries - my conversation with @danintheory, Lead of the Foundations of Reinforcement Learning team at @OpenAI

00:00 Intro: AI's wild week in mathematics

01:21 What OpenAI's Foundations of RL team does

03:08 Dan's journey: from black holes and quantum gravity to frontier AI

07:04 Are AI systems becoming useful for real science

08:21 The AI math moment: Erdős, OpenAI, DeepMind, and Anthropic

08:52 Why the OpenAI result was an act of exploration

10:25 OpenAI vs. DeepMind: informal reasoning vs. formal proof

12:13 RL 101: learning by doing, not just watching

15:10 Why reinforcement learning works

15:58 How RL breaks: sparse feedback and long-horizon tasks

17:03 RLHF: how human feedback shaped early language models

18:48 Move 37, self-play, and the search for novel strategies

22:16 Explore vs. exploit in scientific discovery

24:49 Why RL may now be "the cake," not the cherry on top

25:46 Why RL started working with large language models

27:29 Is RL "sucking supervision through a straw"?

28:47 Why language may be the grounding layer for intelligence

31:46 A contrarian take on the Bitter Lesson

32:41 What test-time compute actually is

34:50 How RL gives models the ability to think

35:40 Verifiable rewards, math, coding, and the messy real world

38:00 What physics can teach us about AI

42:08 Is there a thermodynamics of AI?

43:08 From Erdős problems to Einstein-level AI

45:16 Is AI already doing original science?

45:51 How far are we from AI automating AI research

47:41 Why Dan is excited about the future of science

10:27 AM · Jun 4, 2026 · 14.9K Views

/AI19h ago

OpenAI's Daniel A. Roberts says pairing reinforcement learning with LLMs enables AI to make original scientific and mathematical discoveries

The approach uses verifiable rewards to drive scientific exploration.

--0--

Original posts

Quote posts

Comments

Reposts

Original post

Matt Turck@mattturck#1497inAI

Why AI Can Now Make Discoveries - my conversation with @danintheory, Lead of the Foundations of Reinforcement Learning team at @OpenAI

00:00 Intro: AI's wild week in mathematics

01:21 What OpenAI's Foundations of RL team does

03:08 Dan's journey: from black holes and quantum gravity to frontier AI

07:04 Are AI systems becoming useful for real science

08:21 The AI math moment: Erdős, OpenAI, DeepMind, and Anthropic

08:52 Why the OpenAI result was an act of exploration

10:25 OpenAI vs. DeepMind: informal reasoning vs. formal proof

12:13 RL 101: learning by doing, not just watching

15:10 Why reinforcement learning works

15:58 How RL breaks: sparse feedback and long-horizon tasks

17:03 RLHF: how human feedback shaped early language models

18:48 Move 37, self-play, and the search for novel strategies

22:16 Explore vs. exploit in scientific discovery

24:49 Why RL may now be "the cake," not the cherry on top

25:46 Why RL started working with large language models

27:29 Is RL "sucking supervision through a straw"?

28:47 Why language may be the grounding layer for intelligence

31:46 A contrarian take on the Bitter Lesson

32:41 What test-time compute actually is

34:50 How RL gives models the ability to think

35:40 Verifiable rewards, math, coding, and the messy real world

38:00 What physics can teach us about AI

42:08 Is there a thermodynamics of AI?

43:08 From Erdős problems to Einstein-level AI

45:16 Is AI already doing original science?

45:51 How far are we from AI automating AI research

47:41 Why Dan is excited about the future of science

10:27 AM · Jun 4, 2026 · 14.9K Views

Sentiment

Many users expressed excitement that OpenAI's model found a counterexample to an 80-year-old Erdős conjecture because it positions AI as a research partner advancing math discoveries, while some criticized the company for prioritizing hype.

Pos

89.7%

Neg

10.3%

37 comments with sentiment.

Cluster Engagement

Sentiment

Sentiment building, check back later.

Cluster Engagement

Views

Comments

Reposts

Bookmarks

Expand data

Posts from X

Most Activity

VIEWS30.5KBOOKMARKS93LIKES350RETWEETS20REPLIES52

OpenAI@OpenAI

What happened when one of our models found a counterexample to an 80-year-old Erdős conjecture?

Researchers @alexwei_, @HongxunWu, and @wjmzbmr1 shared the story on the OpenAI Podcast with @AndrewMayne and explained how mathematicians and models can work together to make new discoveries.

36m30.5K35093

Posts from X

Most Activity

VIEWS30.5KBOOKMARKS93LIKES350RETWEETS20REPLIES52

OpenAI@OpenAI

What happened when one of our models found a counterexample to an 80-year-old Erdős conjecture?

Researchers @alexwei_, @HongxunWu, and @wjmzbmr1 shared the story on the OpenAI Podcast with @AndrewMayne and explained how mathematicians and models can work together to make new discoveries.

36m30.5K35093

Original post

Matt Turck@mattturck#1497inAI

Why AI Can Now Make Discoveries - my conversation with @danintheory, Lead of the Foundations of Reinforcement Learning team at @OpenAI

00:00 Intro: AI's wild week in mathematics

01:21 What OpenAI's Foundations of RL team does

03:08 Dan's journey: from black holes and quantum gravity to frontier AI

07:04 Are AI systems becoming useful for real science

08:21 The AI math moment: Erdős, OpenAI, DeepMind, and Anthropic

08:52 Why the OpenAI result was an act of exploration

10:25 OpenAI vs. DeepMind: informal reasoning vs. formal proof

12:13 RL 101: learning by doing, not just watching

15:10 Why reinforcement learning works

15:58 How RL breaks: sparse feedback and long-horizon tasks

17:03 RLHF: how human feedback shaped early language models

18:48 Move 37, self-play, and the search for novel strategies

22:16 Explore vs. exploit in scientific discovery

24:49 Why RL may now be "the cake," not the cherry on top

25:46 Why RL started working with large language models

27:29 Is RL "sucking supervision through a straw"?

28:47 Why language may be the grounding layer for intelligence

31:46 A contrarian take on the Bitter Lesson

32:41 What test-time compute actually is

34:50 How RL gives models the ability to think

35:40 Verifiable rewards, math, coding, and the messy real world

38:00 What physics can teach us about AI

42:08 Is there a thermodynamics of AI?

43:08 From Erdős problems to Einstein-level AI

45:16 Is AI already doing original science?

45:51 How far are we from AI automating AI research

47:41 Why Dan is excited about the future of science

10:27 AM · Jun 4, 2026 · 14.9K Views