/Tech1d ago

Ethan Mollick says Anthropic's Claude Fable model ran autonomously for nine hours to process a 15-page project design document

Story Overview

Ethan Mollick's early test of Anthropic's Mythos-class model shows it tackling a roughly 19-page design spec with almost no ongoing guidance, running for 9.5 hours while spinning up sub-agents, pulling real flight and rail data, writing code, and verifying its own outputs to deliver working software for statistical analysis.

1913.4K3142.5K600.9K
Original post
Ethan Mollick@emollick#181inTech

I've had access to Fable for a bit. A genuine jump in capability, I could feed it a 15 page design document for a project and it would work for 9+ hours and deliver terrific results.

But working with it is weird & weirder is coming

Lots of examples: https://open.substack.com/pub/oneusefulthing/p/what-it-feels-like-to-work-with-mythos?r=i5f7&utm_campaign=post&utm_medium=web&showWelcomeOnShare=true

10:12 AM · Jun 9, 2026 · 552.3K Views
Developer Impact

Delegation patterns look different

Instead of constant back-and-forth, the model handles hundreds of micro-decisions on its own and only surfaces occasional updates, which feels novel for anyone used to steering AI step by step.

Cost Pressure

Real cost depends on how it delegates

At double the per-token rate of earlier Opus models, the nine-hour runs could get expensive unless cheaper sub-agents offset much of the spend, though exact totals from the tests remain unclear.

Sentiment

Many users praised Fable AI completing complex projects over nine hours and the article's snake game example for its impressive results, while others dismissed the claims as uninnovative or questioned safety priorities.

Pos
73.3%
Neg
26.7%
15 comments with sentiment.
Cluster Engagement
Posts from X
Most Activity
Most Activity
VIEWS40.5KBOOKMARKS108LIKES174RETWEETS8REPLIES14
Ethan Mollick@emollick

Some fun examples, I just gave basic prompts and the AI executed: Balatro, but for coin flipping (all the design and ideas was Fable): https://play-flipside.netlify.app/ The best self-aware snake game: https://snake-stable-build.netlify.app/ An isochronic map using real data: https://isochronic-passage-chart.netlify.app/#syd

Ethan Mollick@emollick

I've had access to Fable for a bit. A genuine jump in capability, I could feed it a 15 page design document for a project and it would work for 9+ hours and deliver terrific results.

But working with it is weird & weirder is coming

Lots of examples: https://open.substack.com/pub/oneusefulthing/p/what-it-feels-like-to-work-with-mythos?r=i5f7&utm_campaign=post&utm_medium=web&showWelcomeOnShare=true

1dViews 40.5KLikes 174Bookmarks 108
Alex Albert@alexalbert__

@emollick Thanks for testing it!

Ethan Mollick@emollick

I've had access to Fable for a bit. A genuine jump in capability, I could feed it a 15 page design document for a project and it would work for 9+ hours and deliver terrific results.

But working with it is weird & weirder is coming

Lots of examples: https://open.substack.com/pub/oneusefulthing/p/what-it-feels-like-to-work-with-mythos?r=i5f7&utm_campaign=post&utm_medium=web&showWelcomeOnShare=true

1dViews 8.1KLikes 52Bookmarks 2
Ethan Mollick@emollick

@arthur_spirling I generally get access to Big Labs models early. Some combination of being a researcher & being influential. I have never taken any money from any AI lab and they don’t see my posts in advance.

1dViews 4.9KLikes 40Bookmarks 2
Kavin Stewart@kavinstewart

@emollick Can you share the prompts you used for the games? The snake one (http://snake-stable-build.netlify.app) is way more creative than anything I've seen from other models, so I'm curious how much of that was you vs the model

1dViews 2.5KLikes 3Bookmarks 3
Ethan Mollick@emollick

@arthur_spirling Nope. I don’t write about the vast majority of models I test. And no one has ever asked if I am going to do so.

1dViews 969Likes 8
Arthur Spirling@arthur_spirling

@emollick Sorry if I missed it: could you say more about how you had early access to Fable?

1dViews 5.6KLikes 4
gikiewicz@GrzGik

@emollick 9 hours unsupervised and "terrific results" - that's the dream pitch. But "weird & weirder" is doing a lot of heavy lifting for a cliffhanger. What actually spooked you - the process or the output?

1dViews 1.6KBookmarks 1
carstenbergenholtz@justsomeoneDK

@emollick In case anyone is interested, had ChatGPT 5.5 Pro review the paper https://chatgpt.com/s/t_6a284f7783b4819191f96c0c5c5d638e. Briefly put: A solid paper, and the "flaws" are not major errors but elements that are typically modified during a review process. Note, I haven't read the paper fully myself.

1dViews 182Likes 1Bookmarks 1
Arthur Spirling@arthur_spirling

@emollick Thank you. May I ask if the usual expectation is that you will write a post about the product?

1dViews 988Likes 2
Quiveron@quiveron_x

@emollick Does it talk like Opus 4.8 ?

1dViews 1.6K
Darko Mulej@DarkoMulej

Definitely nothing to worry about.

This is only about an exponential improvement trajectory and RSI (recursive self-improvement – https://www.anthropic.com/institute/recursive-self-improvement).

What can go wrong?

I commented on RSI article, that there were clear if implicit warnings about dangerous capabilities, but this today is new level actually. https://darkomulej.substack.com/p/when-ai-builds-itself-a-warning-disguised

1dViews 1.9KLikes 3
Jessie@jessie_thinker

@emollick Is was hard not to feel something otherworldly while reading this & clicking thru images/examples. "Product release" no longer feels like the appropriate category.

1dViews 676Likes 3
Kid Astronaut@Kidastronaut_

@emollick Damn. I am not using it right. I had it format an excel for me and it still struggled.

Wanted it to match the exact format of a file I uploaded as an example ( image) and it still struggled.

19hViews 166
Sarah Lakzit@SarahLakzit

@emollick the security cost hides in "9+ hours." approval used to be per action. a model that runs unattended that long collapses it to one decision at hour zero. you're not approving a task anymore, you're approving nine hours of judgment you never see.

1dViews 1.4KLikes 2
Paweł J Lisowski@PawelJLisowski

@emollick ye similiar experience so far, its very careful model and verifies its own work a lot better than pervious model. its also very slow at least so far today

1dViews 1.2KLikes 2
Kraggi@Kraggich

@emollick 9 hours means the model now has a longer attention span than I do. you can't watch a run that long, so you need something between babysitting it and finding out at dinner what it decided to build.

1dViews 1.1KLikes 2
Being Co@Being__Co

@emollick Thanks for sharing. Great to get a real world view of what’s possible. The marginal cost of software deployment just continues to plummet. Love the test ideas you used to put it through its paces

1dViews 3.5KLikes 1
Nick Macedo@nick_macedo

@bribiotech @emollick Have you tried it in cowork? It's done amazingly well for PPT documents in cowork for me. Clear and design forward.

1dViews 21Likes 1
Load more posts