/Tech3h ago

Meta AI's Konstantin Mishchenko asks for evidence of Muon optimizer's benefits in reinforcement learning, arguing optimization matters more during pre-training

Developer Joseph Suarez pointed to Puffer project PPO resources

3100102
Original post
Konstantin Mishchenko@konstmish#1792inTech

@jsuarez @ChenTessler Are there some blogs or papers on that? Would be very interested in cases where Muon is helpful in RL, my impression so far has been that optimizers matter more in pre-training.

@ChenTessler It was a major component of Puffer 3. We reswept hypers for ~10 tasks with Muon vs Adam and the gap was quite clear. Try the PufferNet arch over LSTM next!

7:22 AM · Jun 11, 2026 · 71 Views
Sentiment
Sentiment building, check back later.
Cluster Engagement
Posts from X
Most Activity
Most Activity
VIEWS25REPLIES1

@konstmish @ChenTessler Puffing up PPO on my articles tab

@jsuarez @ChenTessler Are there some blogs or papers on that? Would be very interested in cases where Muon is helpful in RL, my impression so far has been that optimizers matter more in pre-training.

2hViews 25Likes 0Bookmarks 0

@jsuarez @ChenTessler ok I guess I'll just try running some examples with pufferlib myself

@konstmish @ChenTessler Puffing up PPO on my articles tab

2hViews 11Likes 0Bookmarks 0