KempeLab researchers analyze internalization, the process of training models to absorb chain-of-thought computations directly into parameters · Digg