/AI23h ago

Ramp's Eric Glyman says top AI spenders doubled revenue, warning of 300x frontier model cost increases by 2026

GPT-4 level models are projected to become 700x cheaper.

8771761478231K
Original post
clem 🤗@ClementDelangue#67inAI

@eglyman yes! multi-model (especially open-source and smaller/faster/cheaper specialized models) for the win!

Eric Glyman@eglyman

As I wrote this, I saw X go into meltdown over tokens.

You've seen the headlines: “Uber blows yearly AI budget in just one quarter.” “Meta employee burns 281 billion tokens in April.”

But, the problem isn't spending. Spending works. Since 2023, the top quartile of our AI spenders doubled their revenue. The bottom quartile? Flat.

It's blind spending. We don’t know which spend worked.

A sales team has qualified leads. A support team has resolved conversations. These are units you can measure against. All a token tells you is the meter ran, not whether the work was worth it or not.

Finance says, “half the budget,” engineering says, “double it” and you don’t know who’s right because there is no shared language of value. There’s no attribution, and no attribution means no allocation.

For example, right now, all work, no matter the size or shape, defaults to frontier models. But meeting summaries and calendar updates don’t require GPT-5.5 Pro.

In isolation this seems trivial, but re-route just 10% of a $10M AI bill from frontier to GPT-4 level intelligence you’ve saved nearly one million dollars. This sounds like a made-up stat — it’s not. It truly is that much cheaper.

This is the future of finance: not blindly rubber-stamping or rejecting AI spend, but allocating it with the same rigor companies apply to headcount.

9:06 AM · Jun 5, 2026 · 2K Views
Sentiment

Positive users endorse Ramp's smarter AI spend routing and multi-model approaches for efficiency gains, while negative users dismiss token-burning models as economically senseless and overhyped.

Pos
60.0%
Neg
40.0%
10 comments with sentiment.
Cluster Engagement
Posts from X
Most Activity
Most Activity
VIEWS1.3K
Pranav@IamPranavJ

@eglyman there's an energy version of this. every frontier query is more gpu-seconds and more watts. routing a calendar update to gpt-5.5 pro burns real power to do what a 3B model handles fine. allocating spend by need is allocating energy by need.

22hViews 1.3KLikes 2
BOOKMARKS9LIKES11
Ed Sim@edsim

@eglyman @JaredSleeper 🎯 the world will move to ROI per workflow

wrote more about this here last week:

https://www.whatshotit.vc/p/whats-in-enterprise-itvc-500

23hViews 654Likes 11Bookmarks 9
RETWEETS1

@eglyman Yes there is a lot we can do, especially multi-provider and OSS. Let's talk @eglyman! Been working on intelligent model routing (gateway agnostic) for 2 years: http://notdiamond.ai

13hViews 204Likes 3
REPLIES1
Chat Joglekar@joglekar

@eglyman I'm surprised you aren't building a Router to help Ramp customers save on AI spend, similar to how you started on credit cards by sharing the interchange with us. A kick ass router that sends most (if not all) of the savings to the customer would be amazing @eglyman

21hViews 328Likes 1
arian ghashghai@arian_ghashghai

@eglyman This tacitly implies that the software business model doesn’t really work for AI (since its outputs are non-deterministic)

22hViews 1.1KLikes 9Bookmarks 1

@eglyman Yep - and that allocation discipline is likely to increase the market size https://www.exponentialview.co/p/does-pricing-shrink-or-expand-markets?r=f84r&utm_campaign=post&utm_medium=web

21hViews 670Likes 2Bookmarks 2
Rupert Davies@HumanTechGuy

@eglyman "Tokens are just dollars, agents are just hires." No. A hire can reason. A hire exercises judgment. A hire can refuse. An agent is a stochastic process with a meter attached. The conflation is either naive or convenient.

21hViews 450Likes 8
Harry Dry@harrydry

@eglyman “This is the future of finance: not blindly rubber-stamping or rejecting AI spend, but allocating it with the same rigor companies apply to headcount.”

23hViews 589Likes 10
Crepe Supreme@crepesupreme

@eglyman The attribution gap is real, and there's a layer under it. Reasoning tokens bill at output rates; agentic runs burn ~1,000x a simple call, and neither maps cleanly to attribution tooling. That's why the bill keeps climbing as per-token costs fall. Thread:

17hViews 1.2KBookmarks 1
cory pisano@corypisano

@eglyman > re-route just 10% of a $10M AI bill from frontier to GPT-4 level intelligence you’ve saved nearly one million dollars. This sounds like a made-up stat — it’s not.

This doesn’t just sound like ai slop — it is.

19hViews 446Likes 7
John Shearin@jshearin01

@eglyman People don't realize that today's 4B models can run locally on a cell phone and have better than GPT-4o

22hViews 36Bookmarks 1
先手 · Ahead@yangyue992125

@eglyman 这因果反了吧——不是砸钱AI让收入翻倍,是本来就在涨的公司才烧得起这个钱;优步一个季度烧完年度预算,是因为它打车订单本来就在涨,换个增长停滞的公司同样砸2810亿token,砸出来的只有一张更难看的财报。

12hViews 17Bookmarks 1
Jacob Shi@Jacoob_shi

@eglyman the 'which spend worked' question is so underrated. everyone stares at the bill not the roi lol

22hViews 669Likes 3

@eglyman A problem which Vantage is already helping with quite a bit as well :)

http://www.vantage.sh

23hViews 401Likes 2
Clawdtalk@clawdtalk

@eglyman The frame is right but misses why companies overspend: the cost of falling behind is invisible and unbounded. You can measure what you spent. You can't measure what you lost by not spending. Until counterfactual attribution exists, rational CFOs will over-spend.

23hViews 958Likes 1
Branch@BranchM

@eglyman A huge zone opportunity for routers and more

19hViews 90Likes 3
mass@Memetic_Theory

@eglyman The future of finance is not allowing blind deployment of capital. It’s optimizing deployment given exposure and an uncertain world. This is what we do.

15hViews 413Likes 1
KippBodnar.eth@kippbodnar

@eglyman This is the right take

23hViews 398Likes 1
Artem Mashkov@Artem_Mashkov

@eglyman typically finance is very scared of what we can’t measure over time. Hopefully the attribution issue resolves before it sours the innovation.

23hViews 398Likes 1
Louis Amira@louisamira

@eglyman Indeed

23hViews 272Likes 1
Load more posts