/Tech4h ago

AI system Fable generates 30-page partial proof of unsolved math problem, succeeding where ChatGPT 5.5 failed

The results, developed with postdoc István Vona, await verification

27744920965K

#50

Original post

Balázs Pozsgay@pozsgaybalazs

Fable achieved a significant breakthrough in one of our open problems. This is a problem where ChatGPT 5.5 could not even begin anything useful. The breakthrough seems legit (although not 100% checked yet), and Fable even claims to have a full solution. >10 hours total runtime so far. A 30 page document with the proofs of some lemmas not yet spelled out. We can not yet know whether Fable indeed has solved it, but even if it is just a partial solution, we are absolutely amazed. More details will follow, and once we are at the end of the story, I will also write a full substack post. Collaboration with István Vona, a postdoc in my group.

1:06 PM · Jun 12, 2026 · 63.8K Views

Sentiment

Positive users highlight Fable's extended runtime as a breakthrough unlocking open math problems beyond GPT-5.5, while negative users dismiss the claims as clickbait slop or hype.

Pos

64.3%

Neg

35.7%

8 comments with sentiment.

Cluster Engagement

Posts from X

Most Activity

VIEWS3.4KBOOKMARKS5LIKES23REPLIES1

Teortaxes▶️ (DeepSeek 推特🐋铁粉 2023 – ∞)@teortaxesTex

Dario may be very well calibrated with his "millions of chips is all that matters to beat Chyna" posture. Yeah it'll often be less efficient. Does that truly matter for tasks you can't solve by paying *any* amount of money, because there is no more qualified labor on the market?

Balázs Pozsgay@pozsgaybalazs

4h3.4K235

RETWEETS2

Ilman Shazhaev@shzhv13

@pozsgaybalazs Raw runtime is the new benchmark. Moving to a 10 hour continuous reasoning path changes everything. Even with holes, 30 pages of lemmas is audit ready.

3h3511

Chew Kok Wah@chewkokwah

@pozsgaybalazs By GPT 5.5 do you mean GPT 5.5 Pro or the regular GPT 5.5 xHigh ?

5h1.3K6

7rtp@fredyfredo123

@pozsgaybalazs mythos is misaligned so no paper is accepted.

Give us the @leanprover project

5h1.2K5

none your kind@based_buffalo69

@pozsgaybalazs I hear you got some slop from the slop farm?

4h585

Drew Wiberg@drewjwiberg

@pozsgaybalazs @roydanroy Why frame it like that? The language around the alternative charges this where it should be objective.

3h8352

Finna@AndilesAnthony

@pozsgaybalazs Please post a screenshot of an actual response from Fable 5. Any random response.

3h54

Tuxsoia@Tuxsoia

@pozsgaybalazs How do you work with it? Put something like /goal in claude code and ask it to do a paper, solve a problem? How can you make it think for so long otherwise?

5h1.1K

🇸🇴 🕳️ Stochastic Dreams@GenHeres123

Can we stop with this argument? The engineers and researchers at Anthropic are elite and they're absolutely on par with the researchers at OpenAI. This was never about chips or compute. If it were, Google which has the best chips, the largest compute infrastructure, and the best data on the planet wouldn't be struggling. Same applies to xAI.

3h802

Finna@AndilesAnthony

@pozsgaybalazs Did it actually spit out an answer or just give you its CoT?

4h672

alias@loadingalias

@pozsgaybalazs @roydanroy Hey, please keep us updated?

2h245

Gerard Sans | Axiom 🇬🇧@gerardsans

@pozsgaybalazs Be aware it’s not a model anymore but an agentic system. You can’t compare it to a regular model. You would need a harness.

4h201

rohan@rohanganapa

@pozsgaybalazs terao’s?

3h183

Dr. Bobby Gomez-Reino@BobbyGRG

@chewkokwah @pozsgaybalazs isnt gpt5pro is a cot/tot harness on top of 5.5 mostly? i hope if they r in a hurry to solve math problems they are not just trying with a thin api harness but have created one. or tbh just use Cursor

4h110

glizzop@dogeglizzy

@pozsgaybalazs Fable deez nuts nigga

2h78

Ripping Whale@zacurate

@pozsgaybalazs Prove it. Show the data. This post evidences nothing besides clickbait. Prove. It.

1h41

Sebastian Buzdugan@sebuzdugan

@pozsgaybalazs what's the verifier here because 10 hours of search can manufacture polished false positives

1h26

Cork King@0korkle0

@based_buffalo69 Someone has a chip on their shoulder. I can smell the resentment through the screen. You know deep down this isnt an example of that in this case, yet you feel so threatened by it that you have to denigrate it without knowing anything about this particular case.

4h16

Finna@AndilesAnthony

@pozsgaybalazs Guys they listened to me. It works. Thanks Dario.

1h3

Kekko D’Amato@kekkodamato_

The >10h runtime detail is what stands out most. We've been so fixated on fast inference that we've barely explored what sustained computation unlocks. If even partial progress on an open problem is achievable at this scale, the real bottleneck shifts to verification — mathematicians reviewing AI-generated proofs. What's the domain?

3h3