Speaking at @aiDotEngineer today about long-horizon RL and the different tradeoffs involved. 2:25pm.
Also here in SF for a week if anyone wants to catchup 👋
Excited to be speaking at @aiDotEngineer World’s Fair alongside partner in crime @ChengxiTaylor.
It seems we have a lot to talk about:
🤼♂️ Actor-critic is hot again? How are current RL algorithms changing with long-horizon rollouts. 🌍 What does a good long-horizon environment look like? How do they differ from terminal based tasks? 🤖 What does good RL infra look like in 2026? Why is long horizon breaking things?
As a bonus, I’ll give a run down of our own RL journey starting from Galactica (2021/2022), early SoTA reasoning efforts at Meta (2023), and more.
Super fun, high signal-to-noise guaranteed. Come join us!
