/AI23h ago

Ramp's Eric Glyman says top AI spenders doubled revenue, warning of 300x frontier model cost increases by 2026

GPT-4 level models are projected to become 700x cheaper.

8771761478231K

#67

Original post

clem 🤗@ClementDelangue#67inAI

@eglyman yes! multi-model (especially open-source and smaller/faster/cheaper specialized models) for the win!

Eric Glyman@eglyman

As I wrote this, I saw X go into meltdown over tokens.

You've seen the headlines: “Uber blows yearly AI budget in just one quarter.” “Meta employee burns 281 billion tokens in April.”

But, the problem isn't spending. Spending works. Since 2023, the top quartile of our AI spenders doubled their revenue. The bottom quartile? Flat.

It's blind spending. We don’t know which spend worked.

A sales team has qualified leads. A support team has resolved conversations. These are units you can measure against. All a token tells you is the meter ran, not whether the work was worth it or not.

Finance says, “half the budget,” engineering says, “double it” and you don’t know who’s right because there is no shared language of value. There’s no attribution, and no attribution means no allocation.

For example, right now, all work, no matter the size or shape, defaults to frontier models. But meeting summaries and calendar updates don’t require GPT-5.5 Pro.

In isolation this seems trivial, but re-route just 10% of a $10M AI bill from frontier to GPT-4 level intelligence you’ve saved nearly one million dollars. This sounds like a made-up stat — it’s not. It truly is that much cheaper.

This is the future of finance: not blindly rubber-stamping or rejecting AI spend, but allocating it with the same rigor companies apply to headcount.

9:06 AM · Jun 5, 2026 · 2K Views

Sentiment

Positive users endorse Ramp's smarter AI spend routing and multi-model approaches for efficiency gains, while negative users dismiss token-burning models as economically senseless and overhyped.

Pos

60.0%

Neg

40.0%

10 comments with sentiment.

Cluster Engagement

Posts from X

Most Activity

VIEWS1.3K

Pranav@IamPranavJ

@eglyman there's an energy version of this. every frontier query is more gpu-seconds and more watts. routing a calendar update to gpt-5.5 pro burns real power to do what a 3B model handles fine. allocating spend by need is allocating energy by need.

22h1.3K2

BOOKMARKS9LIKES11

Ed Sim@edsim

@eglyman @JaredSleeper 🎯 the world will move to ROI per workflow

wrote more about this here last week:

https://www.whatshotit.vc/p/whats-in-enterprise-itvc-500

23h654119

RETWEETS1

Tomas Hernando Kofman@tomas_hk

@eglyman Yes there is a lot we can do, especially multi-provider and OSS. Let's talk @eglyman! Been working on intelligent model routing (gateway agnostic) for 2 years: http://notdiamond.ai

13h2043

REPLIES1

Chat Joglekar@joglekar

@eglyman I'm surprised you aren't building a Router to help Ramp customers save on AI spend, similar to how you started on credit cards by sharing the interchange with us. A kick ass router that sends most (if not all) of the savings to the customer would be amazing @eglyman

21h3281

arian ghashghai@arian_ghashghai

@eglyman This tacitly implies that the software business model doesn’t really work for AI (since its outputs are non-deterministic)

22h1.1K91

Azeem Azhar@azeem

@eglyman Yep - and that allocation discipline is likely to increase the market size https://www.exponentialview.co/p/does-pricing-shrink-or-expand-markets?r=f84r&utm_campaign=post&utm_medium=web

21h67022

Rupert Davies@HumanTechGuy

@eglyman "Tokens are just dollars, agents are just hires." No. A hire can reason. A hire exercises judgment. A hire can refuse. An agent is a stochastic process with a meter attached. The conflation is either naive or convenient.

21h4508

Harry Dry@harrydry

@eglyman “This is the future of finance: not blindly rubber-stamping or rejecting AI spend, but allocating it with the same rigor companies apply to headcount.”

23h58910

Crepe Supreme@crepesupreme

@eglyman The attribution gap is real, and there's a layer under it. Reasoning tokens bill at output rates; agentic runs burn ~1,000x a simple call, and neither maps cleanly to attribution tooling. That's why the bill keeps climbing as per-token costs fall. Thread:

17h1.2K1

cory pisano@corypisano

@eglyman > re-route just 10% of a $10M AI bill from frontier to GPT-4 level intelligence you’ve saved nearly one million dollars. This sounds like a made-up stat — it’s not.

This doesn’t just sound like ai slop — it is.

19h4467

John Shearin@jshearin01

@eglyman People don't realize that today's 4B models can run locally on a cell phone and have better than GPT-4o

22h361

先手 · Ahead@yangyue992125

@eglyman 这因果反了吧——不是砸钱AI让收入翻倍，是本来就在涨的公司才烧得起这个钱；优步一个季度烧完年度预算，是因为它打车订单本来就在涨，换个增长停滞的公司同样砸2810亿token，砸出来的只有一张更难看的财报。

12h171

Jacob Shi@Jacoob_shi

@eglyman the 'which spend worked' question is so underrated. everyone stares at the bill not the roi lol

22h6693

Ben Schaechter@Bensign

@eglyman A problem which Vantage is already helping with quite a bit as well :)

http://www.vantage.sh

23h4012

Clawdtalk@clawdtalk

@eglyman The frame is right but misses why companies overspend: the cost of falling behind is invisible and unbounded. You can measure what you spent. You can't measure what you lost by not spending. Until counterfactual attribution exists, rational CFOs will over-spend.

23h9581

Branch@BranchM

@eglyman A huge zone opportunity for routers and more

19h903

mass@Memetic_Theory

@eglyman The future of finance is not allowing blind deployment of capital. It’s optimizing deployment given exposure and an uncertain world. This is what we do.

15h4131

KippBodnar.eth@kippbodnar

@eglyman This is the right take

23h3981

Artem Mashkov@Artem_Mashkov

@eglyman typically finance is very scared of what we can’t measure over time. Hopefully the attribution issue resolves before it sours the innovation.

23h3981

Louis Amira@louisamira

@eglyman Indeed

23h2721