Epoch AI finds evidence of acceleration on three of four capability metrics, driven by reasoning models like OpenAI o1-preview that now solve previously unresolved math problems
Reasoning models show steeper gains than non-reasoning models on math tasks.
@davidad yeah
@davidad actually you commented this that day on that post. I guess I updated a lot new scaling law, separate of results however good, do you think you sufficiently updated on new curve?
Yeah, this is what Ilya (fore)saw

Its funny how much the whole "strawberry" thing, which turned out to be o1-preview, was dismissed as overhyped at launch when it is clear in retrospect that it was way underhyped. A direct line from models unable to do basic math to solving unresolved math problems in 18 months.
image from: https://epoch.ai/blog/have-ai-capabilities-accelerated
Yeah, this is what Ilya (fore)saw
@nickcammarata Definitely not. I failed to update until I read the DeepSeek-R1-Zero paper