/AI6d ago

Copilot Token Billing Drains Credits Fast, Triggering User Cancellations

2172K2242.1K1.5M
Original postclem 🤗#67
Tommy@Shaughnessy119

The most basic way AI could blow up imo. I'm not saying it does but this is the most obvious way I can see it happening

- Per seat subscriptions are massively subsidized. The flat fee was priced way below what heavy usage actually costs

- For real business use you have to move to the API anyway. Data protections, work integrations and compliance officer approval

- On the API you pay metered rates, and businesses are burning credits way faster than the per seat pricing ever led them to expect

- This is everywhere right now. Internally for us, Codex users, Uber torching its entire 2026 AI budget in 4 months, the Microsoft comments. Just go try an API

I shared more on this here:

- And I don't think most businesses have the money to keep paying increasing API rates without a real change to how they operate (caps needed)

- Because they have a cheap alternative. They can reach open source models through any aggregator (OpenRouter, Venice, Baseten, Together) and still get strong privacy. Venice private data centers, or E2EE/TEE serving GLM 5.1.

More on open source inference provider raises here:

- And the discount is enormous. DeepSeek V4 codes within a hair of Opus on SWE bench at roughly 1/30th the price, and the cheapest open models run closer to 1/100th

- Chinese labs open source frontier grade models. The model is the single biggest cost an inference provider has, and they get it for free

- This idea dies if China goes closed source. That is actually bullish web2 AI labs, because if everyone is closed you pay up for the best intelligence. China goes closed source if they are tired of giving away an asset and they want the revenue and data flow to train new models

- Is this showing up in web2 AI lab revenue yet? No. Revenue is off the charts. Anthropic went from 9B to 47B run rate in five months

- So go forward, what happens?

- I think revenue slowly starts leaking to the open source inference providers (see Venice usage, OpenRouter's $113M raise, Baseten is raising at $11B or triple its valuation in three months, on revenue that went from $200M to $600M annualized in a single quarter)

- It doesnt move overnight, but it caps the labs ability to raise prices, and margins are already deeply negative. OpenAI is reportedly running near negative 122%

- With margins that bad there is no cash flow, so the labs are fully dependent on outside capital to buy GPUs, train models, and keep subsidizing usage (I.e. see Google tapping $80b equity sale, granted 30b for employee RSU taxes. Clearly they think Equity is overvalued or you wouldn't sell it)

- The break comes when that capital stops. Pricing is capped so margins cant improve, and the moment investors lose conviction on payback, the whole flow reverses

- Why would they lose conviction on payback? Back to the start - the inability to improve margins or get businesses to pay more

- This is also limiting, if we start making new drugs with AI or create entirely new businesses, you better believe people will pay up to the max for AI usage

6:49 AM · Jun 2, 2026 · 1.5M Views
Sentiment

Many users agree Copilot's token billing drains credits too fast and triggers cancellations, while others dismiss those arguments as biased or unserious about open-source models.

Pos
66.7%
Neg
33.3%
21 comments with sentiment.
Cluster Engagement
Posts from X
Most Activity
Most Activity
VIEWS40.2K
Tommy@Shaughnessy119

another one @Altimor

4dViews 40.2KLikes 38Bookmarks 23
BOOKMARKS58
Tommy@Shaughnessy119

@Altimor The trend is clear

1dViews 25KLikes 60Bookmarks 58
LIKES143RETWEETS6REPLIES14
Ansem@blknoiz06

@Shaughnessy119 open source ai models competing with closed source on usability/effectiveness is the biggest risk to big labs profitability & also the strongest unexpected tailwind for crypto x ai protocols connecting those new revenue sources back to a token

6dViews 16.1KLikes 143Bookmarks 27
Soren Larson@hypersoren

@Shaughnessy119 Characterized lab value capture / margins under this dynamic this analytically - you might enjoy

6dViews 3.3KLikes 6Bookmarks 9
nicolas, 30 ans@nic_carter

@Shaughnessy119 Very insightful Tommy, fully agree

Not existential for AI but will be a very rough transition period moving from subsidized per seat to metered usage

6dViews 8.6KLikes 32Bookmarks 4
Tommy@Shaughnessy119

@AlgodTrading Agreed, that is partially @ambient_xyz

Less so distillation moreso a chain that provides leading open source models and their fine tunes

6dViews 5KLikes 4Bookmarks 3
Algod@AlgodTrading

@Shaughnessy119 A chain that allows distillation and fine tuning of SOTAS to create cost effective OS models while simultaneously secure the chain

Would be much more erricient efficient than BTC which is just wasting a shit ton of energy for no good reason

6dViews 8KLikes 19
Jay@jayendra_jog

@Shaughnessy119 > I think revenue slowly starts leaking to the open source inference providers

do you have anything substantial to back up the idea that businesses doing high volume are comfortable using "worse" models if their competitors are using SOTA closed source models

6dViews 4KLikes 11Bookmarks 1

@Shaughnessy119 Well said and I think the AI cost issue comes to a head in the 2nd half of this year

6dViews 2.4KLikes 9Bookmarks 1
nicolas, 30 ans@nic_carter

@Shaughnessy119 This is all very bullish for open weight AI and their interfaces

6dViews 1.4KLikes 7Bookmarks 1
Tommy@Shaughnessy119

@blknoiz06 Agreed 100%

6dViews 9KLikes 12
Peter Yang@petergyang

@Shaughnessy119 China is probably not going closed source. Having most of US companies run on Chinese models is a competitive advantage.

22hViews 970Likes 5Bookmarks 1
kache@yacineMTB

@Shaughnessy119 Yeah

23hViews 941Likes 5Bookmarks 1
Conor@jconorgrogan

@Shaughnessy119 What happens is we are going to need more datacenters

6dViews 2.7KLikes 8
Yunc@YuncMeme

@blknoiz06 @Shaughnessy119 A yunc doesn’t understand what a big yunc cooks in the kitchen, but one thing is sure: going to be Habibi.

6dViews 24Likes 5

@Shaughnessy119 Gemma 4 was the biggest gift the west gave to China. They finally could crack their failing agentic capability with CoT. Once Gemma 4 came out, finally Chinese models were standing toe to toe with the Western LLMs on FoodTruckBench as the CoT isn't faked.

5dViews 629Likes 1Bookmarks 1
Tommy@Shaughnessy119

Not really right now. My assumption is just that the centralized APIs are really expensive, businesses use them given the data retention guarantees/someone to sue, some are cutting back and you can get a private offering at 1/100 the price for most business use cases. Its a theory for the future, not something happening right now.

6dViews 3.1KLikes 3Bookmarks 1
Ronan@Ronangmi

@Shaughnessy119 agree on potential for OS model adoption to eat at mrkt share via: broader recognition many tasks dont require frontier pref + enterprise cost crunch

but in current form - and particularly for agents tasks (higher input:output ratios) - are margins not improving?

3dViews 780Likes 3Bookmarks 1
Load more posts