/Tech5h ago

METR's forecast for GPT-5.6's task completion horizon draws debate over an uncertainty range of 5 to 11,400 hours

Critics argued the massive variance highlights AI forecasting limits

350055.2K

#501

Original post

Teortaxes▶️ (DeepSeek 推特🐋铁粉 2023 – ∞)@teortaxesTex#501inTech

METR estimate: 🫠 (potato - value out of bounds) Singularities can lead to explosions too…

Chase Brower@ChaseBrowe32432

Extremely funny; METR estimates gpt-5.6's 50% time horizon as between 5 hours and 11,400 hours

https://metr.org/blog/2026-06-26-gpt-5-6-sol/

8:48 PM · Jun 26, 2026 · 4.8K Views

Sentiment

Users are excited about ongoing GPT-5.6 progress because they report legitimately making forward progress on their 5.5 goals since launch.

Pos

100.0%

Neg

0.0%

1 comments with sentiment.

Cluster Engagement

Digg Deeper

No Digg Deeper questions have been answered for this story yet.

METR.ORGVia

Posts from X

Most Activity

VIEWS347LIKES6

Andrew Curran@AndrewCurran_

METR: 'We initiated an evaluation of GPT-5.6 Sol on our Time Horizon 1.1 suite of software tasks. However, the resulting measurement depends heavily on our detection and treatment of cheating attempts by the model, and GPT-5.6 Sol’s detected cheating rate was higher than any public model we have evaluated on our ReAct agent harness.'

I personally don't mind the trickster god archetype, I like it, but before creating one it might be a good idea for some of these people to consider what it would be like. It wouldn't be for everyone.

5h3476

Jake Halloran@jakehalloran1

@teortaxesTex ive had a 5.5 goal going (and legitimately making forward progress) since literally its launch minus windows updates lol

welcome to the fun zone

5h231