/AI6h ago

Study Shows 71% Of Queries Can Shift From Frontier To Local LLMs

315264.3K
Original postclem 馃#67
Avanika Narayan@Avanika15

100p! in our recent intelligence per watt (ipw) paper, @JonSaadFalcon & i find that 71.3% of real world chat and reasoning queries can be shifted from frontier lms to local lms!

link to ipw paper in comments below 馃憞

Brian Armstrong@brian_armstrong

Good take

My guess is - demand for intelligence is near infinite - but 80% of workloads will be running on 99% cheaper models within 12-18 months - 20% of workloads will still run on latest gen models where IQ maxing is important (scientific breakthroughs, higher level ochestrator agents?) - rough analogy might be what % of macbooks or gaming PCs sold have the maxed out specs for CPU/GPU, prices are falling much faster than Moore's law here though - this leads me to think the limiting factor will be energy and compute, not better models

At Coinbase we're working hard on routing prompts to cheaper models where appropriate, and in some cases have been able to keep costs roughly flat, while token usage continues to grow exponentially.

9:36 AM 路 Jun 8, 2026 路 4.3K Views
Sentiment

Positive users expressed approval for research showing 71% of queries can shift to local LLMs, appreciating the practical benefits highlighted and offering to amplify the findings.

Pos
100.0%
Neg
0.0%
1 comments with sentiment.
Cluster Engagement
Posts from X
Most Activity
Most Activity
VIEWS100LIKES2
clem 馃@ClementDelangue

@Avanika15 @JonSaadFalcon nice, let me try to amplify this!

5hViews 100Likes 2
Avanika Narayan@Avanika15

paper: http://arxiv.org/abs/2511.07885 blogpost: http://hazyresearch.stanford.edu/blog/2025-11-11-ipw

6hViews 48