NanoGPT Speedruns Show Cumulative Gains From Shallow Optimizations · Digg