4h agoMin Li and Haoxiang Wang used Parallax and the SOAP-H optimizer to set a new modded-nanogpt benchmark record of 2,880 stepsThe milestone was achieved without any hyperparameter tuning.