/Tech3h ago

Creator @teortaxesTex argues Western labs struggle with small LLMs because their scaling laws are optimized for trillion-parameter architectures

This counters assertions that raw intelligence drives LLM adoption.

3923472038.9K

#501

Original post

Lisan al Gaib@scaling01#1215inTech

intelligence is all that matters and the largest most expensive models will always be the go to

look at Haiku, Sonnet or OpenAI's mini models

they don't get any love

10:23 AM · Jun 25, 2026 · 10.2K Views

Sentiment

Many users criticize Western AI labs for neglecting smaller models like Sonnet in favor of flagships due to inconsistent performance and lack of care, while some praise specific mini models they find usable.

Pos

30.0%

Neg

70.0%

16 comments with sentiment.

Cluster Engagement

Digg Deeper

No Digg Deeper questions have been answered for this story yet.

Posts from X

Most Activity

VIEWS13.9KLIKES36

Zephyr@zephyr_z9

Not really They generate far more revenue on the flagships Training small models and serving them for cheap will hurt their topline

Teortaxes▶️ (DeepSeek 推特🐋铁粉 2023 – ∞)@teortaxesTex

DSV4-Flash is loved. Gemini 3 Flash is loved. The first Haiku was deemed blessed. o4-mini was great. Yes they don't get as much mythmaking as flagships, but… I think the issue with the Western frontier is that they just *ain't the best* at small. Their scaling laws are for 1T+.

1h13.9K364

BOOKMARKS5RETWEETS1REPLIES6

Teortaxes▶️ (DeepSeek 推特🐋铁粉 2023 – ∞)@teortaxesTex

Lisan al Gaib@scaling01

intelligence is all that matters and the largest most expensive models will always be the go to

look at Haiku, Sonnet or OpenAI's mini models

they don't get any love

2h13.5K355

Teortaxes▶️ (DeepSeek 推特🐋铁粉 2023 – ∞)@teortaxesTex

@zephyr_z9 I don't know if "serving flagships = more revenue" is a law. The bottleneck is supply. If they could sell 2x more Sonnet tokens than Opus tokens, but Sonnet were 3x cheaper to serve, by this logic they'd go all in on Sonnet. So why isn't this the case? mostly cuz Sonnet is bad

Zephyr@zephyr_z9

Not really They generate far more revenue on the flagships Training small models and serving them for cheap will hurt their topline

1h58342

Lisan al Gaib@scaling01

@teortaxesTex I think labs just don't care about small models

they could train much stronger Sonnet or Mini models but simply don't care enough

it doesn't benefit them in any way

Teortaxes▶️ (DeepSeek 推特🐋铁粉 2023 – ∞)@teortaxesTex

2h882110

Teortaxes▶️ (DeepSeek 推特🐋铁粉 2023 – ∞)@teortaxesTex

Anthropic in particular. As they get better at Opus and now Fable and Mythos, they clearly abandon the mid- and low range. It didn't have to be like this, but they have a limited bandwidth.

Teortaxes▶️ (DeepSeek 推特🐋铁粉 2023 – ∞)@teortaxesTex

2h1K81

Zephyr@zephyr_z9

@teortaxesTex I don't think Sonnet is 3x cheaper to serve

1h1.1K41

Teortaxes▶️ (DeepSeek 推特🐋铁粉 2023 – ∞)@teortaxesTex

@zephyr_z9 well that's the problem

1h2054

Zephry@babyzitong

@zephyr_z9 My strategy analysis !

as follows📈📈 👇 👇 👇

1h53

Hong Groyper@Hong60282445

@scaling01 hence sonnet 5 supposedly soon

2h1241

Eis Maus@MrEismaus

@scaling01 I've already got direct access to a stupid person; I don't want to pay for access to stupid models.

2h1091

rabin chhetri@rabinch00606642

@babyzitong @zephyr_z9 very good

1h2

Dustin Tran@dustinvtran

@scaling01 cost matters. but to save cost, it's better to use open weights instead of a closed source lab's second-class citizen

2h234

tokenbender@tokenbender

@scaling01 it’s primarily due to lack of models that can reliably solve a problem for a domain. no hybrid systems that work well outside of agentic scope where delegation is big.

otherwise all harnesses use small models somewhere for reteieval/compaction help.

2h731

Curious Curiousiter@curiousiter

@scaling01 Intelligence also allows for them to ultimately make the best cheapest models if they wanted to, through both distillation and recursive self improvement

2h119

Mr Strijker@mrstrijker

@scaling01 Composer 2.5 proves you wrong.

2h58

Kuroke@kuroke01

@scaling01 intelligence per dollar. It's always about best bang for buck.

1h38

Azyle@AzyleTheCreator

@scaling01 I think Sonnet is loved a lot

2h37

Sheggle@sheggle_

@scaling01 Buddy's never used workflows, or done analysis over 1000s of documents. A lot of AI applications that are just 'automate some closed task xyz' can fare with smaller models just fine.

1h111

MetaCritic “SnapCritic Summer” Capital@MetacriticCap

@scaling01 I use sonnet a lot. Claude uses sonnet a lot

2h33

JesseEvans@JesseEvansce

@scaling01 I love sonnet

2h31