Check out our new paper on internalization: the process of gradually "absorbing" chain of thought computations during training. Our results show that internalization can work for problems that are computationally hard to learn directly. We carefully study method and task specific factors that determine internalization success. To learn more, see https://arxiv.org/abs/2606.20937. With @nikostsilivis Nirmit Joshi @_rkomma Nati Srebro. @NYUDataScience @TTIC_Connect
KempeLab researchers analyze internalization, the process of training models to absorb chain-of-thought computations directly into parameters
Story Overview
KempeLab researchers map out how transformers first master explicit chain-of-thought steps on tough tasks and then fold those steps straight into their weights, skipping token-by-token generation at inference time.
Hard problems reward the two-stage route
On sparse parity, a task that resists direct learning, the models reliably pick up the solution only after explicit CoT supervision followed by gradual token removal, giving the first rigorous proof that internalization can unlock otherwise intractable computations.
Trade-offs still need mapping
Wider models internalize more readily than deeper ones on semiautomata tasks, yet the resulting shortcuts often hurt performance on out-of-distribution cases, leaving open how far the speed gains extend beyond the studied settings.
No Digg Deeper questions have been answered for this story yet.
Most Activity
Check out our new paper on internalization: the process of gradually "absorbing" chain of thought computations during training. Our results show that internalization can work for problems that are computationally hard to learn directly. We carefully study method and task specific factors that determine internalization success. To learn more, see https://arxiv.org/abs/2606.20937. With @nikostsilivis @nirmitj_ @_rkomma Nati Srebro. @NYUDataScience @TTIC_Connect