/Tech15h ago

Reinforcement learning pioneer Richard Sutton argues that supervised LLMs cannot achieve original discovery because they only mimic training inputs

Academic Yu Su argues LLMs can build discovery evaluators.

0001199

Original post

The question is about discovery. If an LLM (trained purely by supervised learning to mimic its input) is fed talk of discovery, then it will talk more about discovery, but it won't do discovery. Perhaps we can agree on the answer to that limited question? Anyway, I hope we are getting closer to the question. It requires a little bit of nuance. Markus Buehler and I were careful not to claim a limitation of LLMs in general as they are not even a well defined concept.

12:45 PM · Jun 7, 2026 · 96 Views

/Tech15h ago

Reinforcement learning pioneer Richard Sutton argues that supervised LLMs cannot achieve original discovery because they only mimic training inputs

Academic Yu Su argues LLMs can build discovery evaluators.

0001199

#295

Original post

Richard Sutton@RichardSSutton#295inTech

12:45 PM · Jun 7, 2026 · 96 Views

Sentiment

Sentiment building, check back later.

Cluster Engagement

Posts from X

Most Activity

No ranked X posts are available for this story yet.