/AI4h ago

Ethan Mollick says Anthropic's Claude Fable model ran autonomously for nine hours to process a 15-page project design document

Story Overview

Ethan Mollick's early test of Anthropic's Mythos-class model shows it tackling a roughly 19-page design spec with almost no ongoing guidance, running for 9.5 hours while spinning up sub-agents, pulling real flight and rail data, writing code, and verifying its own outputs to deliver working software for statistical analysis.

1142.1K2101.5K275K

#176

Original post

Ethan Mollick@emollick#176inAI

I've had access to Fable for a bit. A genuine jump in capability, I could feed it a 15 page design document for a project and it would work for 9+ hours and deliver terrific results.

But working with it is weird & weirder is coming

Lots of examples: https://open.substack.com/pub/oneusefulthing/p/what-it-feels-like-to-work-with-mythos?r=i5f7&utm_campaign=post&utm_medium=web&showWelcomeOnShare=true

10:12 AM · Jun 9, 2026 · 277.9K Views

/AI4h ago

Ethan Mollick says Anthropic's Claude Fable model ran autonomously for nine hours to process a 15-page project design document

Story Overview

1142.1K2101.5K275K

#176

Original post

Ethan Mollick@emollick#176inAI

I've had access to Fable for a bit. A genuine jump in capability, I could feed it a 15 page design document for a project and it would work for 9+ hours and deliver terrific results.

But working with it is weird & weirder is coming

Lots of examples: https://open.substack.com/pub/oneusefulthing/p/what-it-feels-like-to-work-with-mythos?r=i5f7&utm_campaign=post&utm_medium=web&showWelcomeOnShare=true

10:12 AM · Jun 9, 2026 · 277.9K Views

Developer Impact

Delegation patterns look different

Instead of constant back-and-forth, the model handles hundreds of micro-decisions on its own and only surfaces occasional updates, which feels novel for anyone used to steering AI step by step.

Cost Pressure

Real cost depends on how it delegates

At double the per-token rate of earlier Opus models, the nine-hour runs could get expensive unless cheaper sub-agents offset much of the spend, though exact totals from the tests remain unclear.

Sentiment

Many users praised Fable AI's nine-hour project results and snake game demo as impressive evidence of real-world AI progress and AGI potential.

Pos

92.3%

Neg

7.7%

13 comments with sentiment.

Cluster Engagement

Posts from X

Most Activity

VIEWS6.2KLIKES45REPLIES1

Alex Albert@alexalbert__

@emollick Thanks for testing it!

Ethan Mollick@emollick

I've had access to Fable for a bit. A genuine jump in capability, I could feed it a 15 page design document for a project and it would work for 9+ hours and deliver terrific results.

But working with it is weird & weirder is coming

Lots of examples: https://open.substack.com/pub/oneusefulthing/p/what-it-feels-like-to-work-with-mythos?r=i5f7&utm_campaign=post&utm_medium=web&showWelcomeOnShare=true

3h6.2K452

BOOKMARKS6RETWEETS2

Ethan Mollick@emollick

Some fun examples, I just gave basic prompts and the AI executed: Balatro, but for coin flipping (all the design and ideas was Fable): https://play-flipside.netlify.app/ The best self-aware snake game: https://snake-stable-build.netlify.app/ An isochronic map using real data: https://isochronic-passage-chart.netlify.app/#syd

3h2.9K96

Kavin Stewart@kavinstewart

@emollick Can you share the prompts you used for the games? The snake one (http://snake-stable-build.netlify.app) is way more creative than anything I've seen from other models, so I'm curious how much of that was you vs the model

2h2.5K33

carstenbergenholtz@justsomeoneDK

@emollick In case anyone is interested, had ChatGPT 5.5 Pro review the paper https://chatgpt.com/s/t_6a284f7783b4819191f96c0c5c5d638e. Briefly put: A solid paper, and the "flaws" are not major errors but elements that are typically modified during a review process. Note, I haven't read the paper fully myself.

3h18211

Quiveron@quiveron_x

@emollick Does it talk like Opus 4.8 ?

3h1.6K

Sarah Lakzit@SarahLakzit

@emollick the security cost hides in "9+ hours." approval used to be per action. a model that runs unattended that long collapses it to one decision at hour zero. you're not approving a task anymore, you're approving nine hours of judgment you never see.

2h1.4K2

Paweł J Lisowski@PawelJLisowski

@emollick ye similiar experience so far, its very careful model and verifies its own work a lot better than pervious model. its also very slow at least so far today

3h1.2K2

GeniusPothead 💹🧲@GeniusPothead

@emollick

3h1.1K2

Nick Macedo@nick_macedo

@bribiotech @emollick Have you tried it in cowork? It's done amazingly well for PPT documents in cowork for me. Clear and design forward.

3h211

Teja Padala | AI & Product@tejapadala93

@emollick the 9-hour run is wild but the thing that stuck with me is the bottleneck just moved onto the doc you hand it. fed a sloppy spec once and got back beautifully executed nonsense, fast. writing the spec is quietly becoming the whole job.

2h1.9K1

Alexander Benz@alexanderbenz

@emollick 9+ hours on a 15-page doc is exactly what we see with Claude Code on Mato. It runs unsupervised and makes judgment calls you didn't ask for. Some are right. Some aren't. What category of weirdness showed up first for you?

3h1.8K1

Quentin André@andre_quentin

@emollick Oh my god the self-aware snake game is... genuinely funny and surprising?!

3h5062

Koby Ofek@kobyof

@emollick Ethan, stop everything you’re doing, drop every engagement you have, and open a game company with just this snake game as the product. It’s awesome!

2h1842

Eth@EtherCoins

@emollick Checked your article, most of it at least.

Still did not find anything surprising or innovative, have you tried to test it in what apparently is its niche? (Security and attack vectors eval).

3h5821

Mike Bradley@The_Only_Signal

@emollick Sounds like GPT-Codex-5.5

3h5671

Virgil Maro@_virgil19

@emollick nine hours is past the point where it feels like your project. that's the weird part.

3h5471

BriBiotech@bribiotech

@emollick Nice I don't get is why Anthropic can't sort out presentations. Claude in PowerPoint produced slides read like someone had a case verbal diarrhea, theres is no balance all ideas are dumped into the pack

3h1.5K

Element Dong@elementdsj

@emollick 15 pages in, 9 hours of autonomous work, terrific results. the part that's hard to adjust to isn't the output. it's not being present for any of the decisions.

3h4861

Radhakishan Jat@rkjat65

@emollick Application on distance calculation using Isochromic map was really impressive

3h1.4K

David Valerio Gilmore@davidvgilmore

@emollick I've wondered how Fable would approach subtasks. From your write-up, it's clear it continues the trend of smarter models producing better outcomes by better managing subagents.

Frontier models managing small open models (via model routing) is the path to abundant intelligence

3h1.4K