this is a banger paper
they estimate no-thinking time-horizons > it's doubling every 373 days!
"doubling the 50% TH requires a 4.2× increase in total parameters, a 2.1× increase in active parameters, a 1.3× increase in the layer count, or a 3.1× increase in pretraining FLOPs"
(to no surprise increasing layer count is most effective at increasing no-thinking time horizons)
"At the slowest doubling within the 95% CI, no-CoT THs still reach almost 10 minutes of latent reasoning by 2030"
New paper!
Think Fast: Estimating No-CoT Task-Completion Time Horizons of Frontier AI Models
@METR_Evals showed that models' time horizons have doubled every few months. We ask: what length of tasks can models complete without any CoT?







