The first set of RL experiments on PufferLib 5.0 (dev) is running now. It may take some time to refine, but I'm confident we have the core algorithmic change roughly correct. Only ~500 lines of CUDA C!
5:11 PM · Jun 15, 2026 · 3.5K Views
The updates require operators to rerun prior experimental sweeps
The first set of RL experiments on PufferLib 5.0 (dev) is running now. It may take some time to refine, but I'm confident we have the core algorithmic change roughly correct. Only ~500 lines of CUDA C!
Users are disappointed with PufferLib 5.0 because its core CUDA changes force them to rerun existing RL experiment sweeps.
@jsuarez @DanAdvantage Noooooo I gotta rerun my sweeps 😭
The first set of RL experiments on PufferLib 5.0 (dev) is running now. It may take some time to refine, but I'm confident we have the core algorithmic change roughly correct. Only ~500 lines of CUDA C!