if you're doing RL on agent use cases, check out this video.
agents might seem like the most obvious application of post training right now, but there are a number of fundamental challenges that the open source is still working through.
this video walks through that at a meandering and educational pace.
https://www.youtube.com/watch?v=cixmqTsi2A4