/Tech1h ago

Shopify CTO Mikhail Parakhin pairs Fable 5 for coding with GPT-5.6 Max for running experiments

His evaluation found GPT-5.6 Max outperformed Opus 4.8 overall.

2630677922.5K

#501

Original post

Mikhail Parakhin@MParakhin#1103inTech

Not as relevant now :-(: I had an opportunity to deeply test both Fable 5 and GPT-5.6 Max. 5.6 is clearly better than Opus 4.8 at everything (slightly faster, too, though that depends on the load). Vis-a-vie Fable, it is clearly worse on coding, but better on agentic workloads. I had Fable write code, 5.6 run experiments - dreamy…

9:35 PM · Jun 26, 2026 · 23.6K Views

Sentiment

Users praise GPT-5.6 Max for outperforming Opus 4.8 and Fable 5 on agentic and coding tasks due to its execution quality and OpenAI's scaling advantages.

Pos

100.0%

Neg

0.0%

6 comments with sentiment.

Cluster Engagement

Digg Deeper

No Digg Deeper questions have been answered for this story yet.

Posts from X

Most Activity

VIEWS1.9KREPLIES2

Teortaxes▶️ (DeepSeek 推特🐋铁粉 2023 – ∞)@teortaxesTex

I trust Mikhail

Mikhail Parakhin@MParakhin

50m1.9K152

BOOKMARKS4

Mikhail Parakhin@MParakhin

@veeransg5 No, ultra = workflows on high, the trick is to use max and then say: "Please start workflows with multiple agents..." - agents inherit max effort, makes a big difference.

26m77974

LIKES17

Mikhail Parakhin@MParakhin

@AlanRBlair I did. On agentic non-coding (taking actions) I found 5.6 clearly better. On discussing history/math Fable has an edge.

1h1.1K172

Mikhail Parakhin@MParakhin

@Khalin_George Oh yeah, endless critique loop. Fable is very good, so, maybe only 3 iterations. 5.6 makes far fewer bugs than 4.8, but is no Fable - so, 7-8, even 9 iterations "review changes, find bugs - fix - review changes, find bugs - fix, ..."

14m54612

Mikhail Parakhin@MParakhin

@SebastianSzturo Testing a preview. Don't have access to either anymore :-(

25m78451

Alan Blair@AlanRBlair

@MParakhin Amazing, thank you. Good to know someone can snip at Fable's heels.

The 2 days I used Fable; it did have a bit of that Big Model Smell - same vibes with 5.6? Or just an improvement in 5.5?

1h3831

Alan Blair@AlanRBlair

@MParakhin Tell us more! No one else has tested both and spoken about it!

Any chance you used both for non-coding applications?

1h1.1K4

veerana gowda@veeransg5

@MParakhin Ultra and the max reasoning mode is the same?

34m1.1K2

Georgy Khalin@Khalin_George

@MParakhin What kind of setup do you have for running experiments? Does it involve extra agents for verification?

18m5251

Bright Mirror@_brightmirror

@MParakhin Fable or Sol Max- which felt like the most intelligent?

23m1893

Mikhail Parakhin@MParakhin

@manabiSRS Ultra is just multiple agents, but each lower reasoning effort - high. You can start them from Max - then they inherit Max, makes a difference.

24m3562

Aiden@VibeCodeAiden

@MParakhin God damn Fable on the coding and deliberation while 5.6 executes perfectly? THE Setup. I wish...

36m6051

manabi.io@manabiSRS

@MParakhin Did you try 5.6 Sol Ultra?

59m5531

Sebastian Szturo@SebastianSzturo

@MParakhin Now tell us how in the world you get access to both!?

50m3641

Strata@ChainZenit

@MParakhin that sounds like a wild testing session, how do you manage it?

1h597

Uncle J@UncleJAI

@MParakhin I think this split is the interesting part.

A model can be worse at raw code and still better at delegated work if it plans, recovers, and uses tools with less babysitting.

1h578

Ferbin@Ferbin08

@MParakhin Code mistakes you fix. Agent mistakes cascade. That's why it matters.

1h563

appcaster@Himomohi

@MParakhin 쇼피파이 로 이직하면 가능합니까 ?

1h411

Gregor@bygregorr

@MParakhin what were the agentic tasks specifically? not sure how something lags on coding but leads on agentic if most agentic work is just multi-step codegen under the hood

36m353

Rush hour Notes@RushHourNotes

@MParakhin マルチエージェント運用が最適解ってことか。単体の性能議論より、どう組み合わせるかのインフラ設計が重要になりそう。

36m262