N.B. p-values are as follows: P(invalid) = 2.3e-5 P(non-improvement) = 0.168
New Modded-NanoGPT optimization SOTA: @tensorqt has achieved a 2925-step run (-5 steps vs. prev SOTA) by adding a late parameter-space extrapolation step to the previous record.