Gwern argues Lean-trained LLMs will have superior scaling exponents, eventually justifying large-scale software rewrites · Digg