Zhehang Du's Newton-Muon optimizer hits the Modded-NanoGPT target in 3,275 steps, beating baseline Muon
It trails NorMuon and the 3,125-step benchmark record.
——1——
QUOTE POST
#424Keller Jordan@KELLERJORDAN0
Newton-Muon
7:20 PM · May 26, 2026 · 1.4K Views
Modded-NanoGPT optimization result #18: @zhanpeng_zhou has achieved a step count of 3225 with a preconditioned Muon variant called PMuon. This non-SOTA result is notable because it doesn't use the other SOTA-track techniques like update clamping and contra-muon.
4:21 AM · May 27, 2026 · 2K Views
QUOTE POST
#424Keller Jordan@KELLERJORDAN0
.@zhanpeng_zhou's post on this result:
4:22 AM · May 27, 2026 · 1.5K Views

