Microsoft Research AI Frontiers' Dimitris Papailiopoulos says optimized AdamW variants can match Muon's validation loss per step · Digg