DHH says GPT-5.5 shows substantial gains on complicated agent work, reversing version 5.2's lag behind Opus and marking a recovery for OpenAI amid rising competition

VIEWS199.1KBOOKMARKS91LIKES1.6KRETWEETS49REPLIES164

GPT-5.5 is a very good model

For complicated agent work, it's amazing how much GPT5.5 has improved. I found 5.2 to be very far behind Opus. Now using Opus 4.7 after 5.5 feels like a big step backwards. Gotta love this level of competion! Strong comeback for OpenAI.

38d199.1K1.6K91

kache@yacineMTB

Fact check: true

DHH@dhh

For complicated agent work, it's amazing how much GPT5.5 has improved. I found 5.2 to be very far behind Opus. Now using Opus 4.7 after 5.5 feels like a big step backwards. Gotta love this level of competion! Strong comeback for OpenAI.

38d22.6K20918

DHH@dhh

The Omarchy 4 branch is now 30,000 lines of new code. The majority of it was written by GPT5.5. It's been so, so good at QML. You still need to review, but there's just no way this scale of a conversion would be feasible without AI in a reasonable time. https://github.com/basecamp/omarchy/pull/5856

38d8.3K11811

Vaibhav (VB) Srivastav@reach_vb

GPT-5.5 cranking out 30k lines of QML for the Omarchy 4 branch + nailing subtle agentic reasoning!!

DHH@dhh

For complicated agent work, it's amazing how much GPT5.5 has improved. I found 5.2 to be very far behind Opus. Now using Opus 4.7 after 5.5 feels like a big step backwards. Gotta love this level of competion! Strong comeback for OpenAI.

38d7.5K829

jason@jxnlco

thnx

DHH@dhh

For complicated agent work, it's amazing how much GPT5.5 has improved. I found 5.2 to be very far behind Opus. Now using Opus 4.7 after 5.5 feels like a big step backwards. Gotta love this level of competion! Strong comeback for OpenAI.

38d8.4K571

DHH@dhh

But what impresses me just as much is how good it is at explaining MY OWN CODE to me when working on Basecamp! Especially delicate JavaScript interactions with lots of subtle nuances. Real glimpses of AGI there.

38d2.7K403

kache@yacineMTB

It's amazing how many people I end up having similar opinions to simply by being in the arena with an open mind

kache@yacineMTB

Fact check: true

38d2K361

Erdal@ErdalToprak

@dhh You can also increase the max depth and max threads to have more sub agents and define them by reasoning effort, codex is very cool

38d50122

찡긋@Alignment100

@gdb OpenAI Korea B2C "ME"

38d3111

John Zetterman@jzetterman

@dhh Is 4.0 on Dev or Edge yet?

38d3371

DHH@dhh

@jzetterman Neither. Still in wild flux. Will probably come to dev in a few weeks.

38d2864

Eric S. Raymond@esrtweet

@dhh Agreed. 5.5 is a noticeable advance over anything previous.

38d4477

Kevin@kevincodex

@dhh yes

38d1235

Steve Gaudio@SteveGaudio

@dhh I don’t know where it goes from here but if 5.5 codex is the last ai model to ever be released I’d be fine with that, it’s amazing.

38d341

Joshua STW@Joshua_stw1

@gdb @gdb can you call the next model Goblin?

38d291

Shantun Singh Parmar@ParmarShantun

@dhh opus 4.7 doesn't exist yet, maybe double check which models you're actually comparing

38d44

Olivier Bonnaure 🥑 ⚡🇫🇷🇬🇷@olivierb

@dhh I need to test GPT5.5 On my Ruby like language made with rust. Right now using deepseek / opus 4.7 & minimax ...

38d32

Lord Follicle's whomsoever machine@wotyagerrin

@theramblingfool @dhh Ohh maybe I need to use the codex app instead of OpenCode with GPT 5.5 and the web based thing.

38d91

Manoj@mbajaj_

30,000 lines of agent-generated QML in one branch. the model quality debate is interesting but the real story here is that the review bottleneck is now the only bottleneck. at what point does "you still need to review" become physically impossible at this scale? nobody is reviewing 30,000 lines meaningfully.

38d2662

TRENDS@TrendsDotGlobal

@gdb this...

38d21