Meta Introduces ScheduleFree+ For Learning-Rate-Free LLM Training
โโ0โโ
After a few hours of working on this, I have been unable to get it to work in PufferLib for RL. It is far less stable and far more brittle than simple cosine decay, even across different timestep budgets
Sold. This guy rocks
7:47 PM ยท May 22, 2026 ยท 10K Views
9:48 PM ยท May 22, 2026 ยท 293 Views