/Tech32d ago

@nrehiew_ says Opus 4.7 and GPT 5.5 effective costs converge because developers reject more GPT 5.5 code

Mean agent request costs vary 9x across evaluated model families.

1242K179714170.3K

#1312

Original post

Sarah Wang#1354

Cursor@cursor_ai#1312inTech

Introducing the Cursor Developer Habits Report.

We’re sharing some of our findings on how software development is changing.

It’s based on the most comprehensive dataset on AI coding in the world, across all model families.

8:47 AM · May 28, 2026 · 157.8K Views

Sentiment

Many users appreciate Cursor reports on AI coding model cost variations as useful reminders to default to cheaper options with smarter strategies, while others call the 9x price gaps insane since premium models like Opus aren't worth it.

Pos

62.5%

Neg

37.5%

6 comments with sentiment.

Cluster Engagement

Digg Deeper

No Digg Deeper questions have been answered for this story yet.

Posts from X

Most Activity

VIEWS24.9KBOOKMARKS54LIKES244RETWEETS15REPLIES14

eric zakariasson@ericzakariasson

the economics of intelligence

> Cost per agent request varies by nearly 9x across model families, showing that the same workflow can have very different cost profiles depending on the model behind it.

some higher cost models are cheaper in the long run due to increased intelligence, but for p50 requests a model like composer 2.5 will do the job both faster and cheaper

a lot of interesting data in this report, recommend a read

Cursor@cursor_ai

Introducing the Cursor Developer Habits Report.

We’re sharing some of our findings on how software development is changing.

It’s based on the most comprehensive dataset on AI coding in the world, across all model families.

32d24.9K24454

wh@nrehiew_

Despite GPT 5.5 having a higher output price than Opus 4.7, Cursor found that Opus4.7 Agent requests are almost double the cost of GPT 5.5's.

But, normalized for % of lines accepted, both models end up in similar cost territory, which indicates more lines from 5.5 are rejected

Cursor@cursor_ai

Introducing the Cursor Developer Habits Report.

We’re sharing some of our findings on how software development is changing.

It’s based on the most comprehensive dataset on AI coding in the world, across all model families.

32d3.7K4815

LayBacc@Lay_Bacc

@ericzakariasson the 10X engineer is real

32d781

ueaj@_ueaj

@nrehiew_ they should control for task difficulty though, even a vibe-based analysis would suffice

32d1114

eric zakariasson@ericzakariasson

@stalmico not necessarily. highest cost models are usually more intelligent, meaning that they more often make the correct change on the first try. if they don't, it'll be both slower and more expensive

32d111

Dark Stalwart@darkstalwart

@ericzakariasson Can you guys improve support for cursor windows app?

32d10

Steven Collard@stalmico

@ericzakariasson so the expensive models finish tasks faster?

32d8

K.io@AlguemAi1313

@ericzakariasson this is insane. i dont see opus 8x better than composer 2.5. The composer price is a big deal!

32d641

eric zakariasson@ericzakariasson

@Lay_Bacc literally

32d321

plainscope@plainscope

@ericzakariasson That 9x cost variation is a good reminder. We've seen similar stuff internally; the 'cheaper' model often needs more human help or re-runs, which quickly eats up any initial savings.

32d40

Alex UGift@Radipdegen

@nrehiew_ "output more, accept less" territory is wild

5.5 really out here costing the same for way more words nobody keeps

32d39

Matt Boliah@Viby_Matty

@nrehiew_ 2 different strategies, 2 similar results

Interesting. Thanks for the analysis

32d36

Dark Stalwart@darkstalwart

@ericzakariasson I can’t scroll with swipe feature when using RDP using windows app - swipe to scroll works for every other app except cursor

When initiating new chat, it shows a weird pop up saying - open “session” (or something like that) with - and gives list of apps like notepad cursor etc.

32d61

Guilherme O'Tina@guilhermeotina

@ericzakariasson the 9x variance is real but i think it grows further in agentic loops specifically. a model that fails 20% more doesnt cost 20% more in multi-step workflows. failures cascade into retries, context refills, and downstream rework. the cheap model premium compounds nonlinearly

32d18

Rambam the grey@meowbooksj

@nrehiew_ this says more about the average claude enjoyer than on the models

32d16

Roubal Sehgal@roubalsehgal

@ericzakariasson using this type of flow at work - we default to lighter models now and only bump up when the task actually needs it

turns out most requests don't need the nuclear option lol and sometimes even auto option in cursor works well

32d10

eric zakariasson@ericzakariasson

@darkstalwart yes! how can we improve it?

32d9

Steven Collard@stalmico

@ericzakariasson Isnt it why they sometimes have a quick/fast version tho

32d3

Eclipse 🌖@ECLresearch

@nrehiew_ So the real metric isn’t raw token cost, but *accepted* lines per dollar — and on that, 5.5’s higher rejection rate pulls it level with Opus 4.7

32d2

Atomic Strata@AtomicStrata

@ericzakariasson Efficient agents are not just about cheaper models. They are about smarter memory decisions ✅

32d1