/AI2h ago

AI Models Skip Web Searches and Give Up Sooner to Cut Token Costs

5219865128.3K
Original post unavailable.
Sentiment

Many users criticize AI models for skipping web searches and giving up sooner to cut token costs, as this causes degraded quality, less effort, and reduced reliability.

Pos
11.8%
Neg
88.2%
18 comments with sentiment.
Cluster Engagement
Posts from X
Most Activity
Most Activity
VIEWS1.8KBOOKMARKS1LIKES10RETWEETS1REPLIES5
Bill Gurley@bgurley

Maybe we are moving to the "optimization" phase of the AI game. Lots of things change in this phase. In the dot-com boom, all website were built on Oracle+Sun. Five years later all MySQL + Linux. Eventually the margins matter.

2hViews 1.8KLikes 10Bookmarks 1
Tyler Palmer@tyler__palmer

@bgurley So they get lazy over time… just like us

2hViews 22Likes 2

@bgurley Did you see @thefriley interview on all-in? Recommend

2hViews 26Likes 1
Anil Murty ⟁@anilmurty_ai

@bgurley As a data point - I analyzed my role spend from using Claude code with a Max-5x plan ($100/mo), for the last 30 days, if I was paying for usage (like api calls) and it was almost $5K. And I’m not even the greatest TokenMaxxer around :)

2hViews 19Likes 1
Tech P@Tech_p001

@bgurley "We went from maximizing accuracy to optimizing compute budgets, and the user experience is paying the price. LLMs aren't getting dumber; they're being engineered to 'give up' early because web searches and deep reasoning loops are too expensive at scale."

2hViews 31

@bgurley of course ... and the control on this without people churning is the margin for these businesses ... the lack of framework for what to spend and how to evaluate an answer is the margin (just like ads a generation ago)

2hViews 279Likes 1

@bgurley @thefriley A significant deflationary curve in compute costs, she mentioned that from GPT-4 to GPT-5.4, the cost of tokens dropped by approximately 97%

And they are highly focused on costs for inference

It’s a good watch https://youtu.be/TjrShuj_Zsg?si=0iE_4XBfJypPg3bX

1hViews 158Likes 1
Heidi Legg@heidilegg

The simple fact that AI can create an amazing deck for me, but won’t allow me to download it into Google slides or Ppt (if anyone still uses that) so I can edit it and use it (rather than staring at like something I desperately want in a window shop but can’t buy at any cost) convinces me that AI is still not there.

1hViews 2
Billy J. Hatler@billyjhatler

@bgurley I am having a similar experience Bill. Recently, I’ve had longer chats to get to good outcomes and almost have to coax the model to get to those outcomes. 🤘🏻

1hViews 62Likes 1
X Girls@thesoragirls

@bgurley I have especially noticed this with Gemini models.

2hViews 53Likes 1

@bgurley Meaning they’re deliberately avoiding web search for recency, even when prompted?

2hViews 35Likes 1
Aditya Mehrotra@AdityaKMehrotra

@bgurley Which models have you noticed this with? I’ve seen the same laziness with Sonnet lately, but not with the new Opus 4.8

Could be plan or query dependent

2hViews 33Likes 1
Bill Gurley@bgurley

@tyler__palmer 🤣🤣🤣

2hViews 25Likes 1
Mike Scroggins@SignalChainMike

@bgurley Yeah I agree, the Claude family has had a weird habit of telling you to go to bed or do other things. It definetly seems like their is some effort on its part to discourage people using it for long periods. Could be for mental health, but the cynical read is that its reduce cost.

2hViews 79
Subramanya N@subramanya

@bgurley yeah. this is the part users will notice first: not the bill, but the model quietly doing less work. evals need to catch effort regression, not just wrong answers.

2hViews 62
BlackJack@BlackJackPartII

@bgurley Interesting and makes sense. I’ve noticed the same thing recently on Claude. It pushes back on researching topics and is more active in shaping our discussions.

2hViews 17Likes 1
Nathan Quantum@AI_WarriorNQ

@bgurley noticed this too. reasoning tokens getting trimmed. models aren't dumber they're just being told to think less

1hViews 53
Taikhoom Sojitrawala@TSojitrawa5253

@bgurley Yeah probably because they want their s-1 to be more appealing to investors.

2hViews 47
Kim Benabib@KimBenabib

@bgurley Are you sure you're using Gemini? Don't think I've ever seen Gemini NOT do web search.

2hViews 43
Gavin Tan@MoatOwl

@bgurley I noticed this under Opus 4.8 vs 4.7. Same prompts for company deep dives, output is 10-15% shorter with less effort put in.

1hViews 12Likes 1
Load more posts