3h ago

Meta Introduces ScheduleFree+ For Learning-Rate-Free LLM Training

โ€”โ€”0โ€”โ€”
Original post

After a few hours of working on this, I have been unable to get it to work in PufferLib for RL. It is far less stable and far more brittle than simple cosine decay, even across different timestep budgets

Joseph Suarez ๐ŸกJoseph Suarez ๐Ÿก@jsuarez

Sold. This guy rocks

7:47 PM ยท May 22, 2026 ยท 10K Views
9:48 PM ยท May 22, 2026 ยท 293 Views