/Tech5h ago

Nathan Lambert Releases New Lectures on Reasoning Models and DPO

152212414211.3K

Original post

I launched 3 more videos in my post-training course! 1. Lecture 5: The rise of reasoning models 2. Lecture 6: DPO derivation, intuitions, and practice 3. A Q&A from readers on lectures 1-4

rlhfbook dot com slash course More soon!

3:14 PM · Jun 15, 2026 · 7.3K Views

Sentiment

Users are praising Nathan Lambert for releasing new lectures on reasoning models and DPO because they see the shared material as wholesome and valuable.

Pos

100.0%

Neg

0.0%

4 comments with sentiment.

Cluster Engagement

Posts from X

Most Activity

VIEWS2.1KBOOKMARKS11LIKES14REPLIES2

Nathan Lambert@natolambert

Course page: https://rlhfbook.com/course

YT Playlist: https://www.youtube.com/watch?v=x-MqKBzoxkI&list=PLL1tdVxB1CpVpEtMHxwuR4uI4Lxjw00_y&index=7

Nathan Lambert@natolambert

I launched 3 more videos in my post-training course! 1. Lecture 5: The rise of reasoning models 2. Lecture 6: DPO derivation, intuitions, and practice 3. A Q&A from readers on lectures 1-4

rlhfbook dot com slash course More soon!

5h2.1K1411

RETWEETS2

Nathan Lambert@natolambert

I also made a page with extra resources (talks and books I recommend): https://rlhfbook.com/course#extra-resources

Nathan Lambert@natolambert

Course page: https://rlhfbook.com/course

YT Playlist: https://www.youtube.com/watch?v=x-MqKBzoxkI&list=PLL1tdVxB1CpVpEtMHxwuR4uI4Lxjw00_y&index=7

5h1.8K78