I launched 3 more videos in my post-training course! 1. Lecture 5: The rise of reasoning models 2. Lecture 6: DPO derivation, intuitions, and practice 3. A Q&A from readers on lectures 1-4
rlhfbook dot com slash course More soon!
I launched 3 more videos in my post-training course! 1. Lecture 5: The rise of reasoning models 2. Lecture 6: DPO derivation, intuitions, and practice 3. A Q&A from readers on lectures 1-4
rlhfbook dot com slash course More soon!
Users are praising Nathan Lambert for releasing new lectures on reasoning models and DPO because they see the shared material as wholesome and valuable.
Course page: https://rlhfbook.com/course
YT Playlist: https://www.youtube.com/watch?v=x-MqKBzoxkI&list=PLL1tdVxB1CpVpEtMHxwuR4uI4Lxjw00_y&index=7
I launched 3 more videos in my post-training course! 1. Lecture 5: The rise of reasoning models 2. Lecture 6: DPO derivation, intuitions, and practice 3. A Q&A from readers on lectures 1-4
rlhfbook dot com slash course More soon!
I also made a page with extra resources (talks and books I recommend): https://rlhfbook.com/course#extra-resources
Course page: https://rlhfbook.com/course
YT Playlist: https://www.youtube.com/watch?v=x-MqKBzoxkI&list=PLL1tdVxB1CpVpEtMHxwuR4uI4Lxjw00_y&index=7

@natolambert Thanks for sharing!:)

@natolambert that's wholesome, love it! keep it up

@natolambert Massive 👏

@natolambert Bravo 👏