/AI2h ago

AI Models Skip Web Searches and Give Up Sooner to Cut Token Costs

5219865128.3K

Original post unavailable.

/AI2h ago

AI Models Skip Web Searches and Give Up Sooner to Cut Token Costs

5219865128.3K

Original post unavailable.

Sentiment

Many users criticize AI models for skipping web searches and giving up sooner to cut token costs, as this causes degraded quality, less effort, and reduced reliability.

Pos

11.8%

Neg

88.2%

18 comments with sentiment.

Cluster Engagement

Posts from X

Most Activity

VIEWS1.8KBOOKMARKS1LIKES10RETWEETS1REPLIES5

Bill Gurley@bgurley

Maybe we are moving to the "optimization" phase of the AI game. Lots of things change in this phase. In the dot-com boom, all website were built on Oracle+Sun. Five years later all MySQL + Linux. Eventually the margins matter.

2h1.8K101

Tyler Palmer@tyler__palmer

@bgurley So they get lazy over time… just like us

2h222

Giuliano Giacaglia@giacaglia

@bgurley Did you see @thefriley interview on all-in? Recommend

2h261

Anil Murty ⟁@anilmurty_ai

@bgurley As a data point - I analyzed my role spend from using Claude code with a Max-5x plan ($100/mo), for the last 30 days, if I was paying for usage (like api calls) and it was almost $5K. And I’m not even the greatest TokenMaxxer around :)

2h191

Tech P@Tech_p001

@bgurley "We went from maximizing accuracy to optimizing compute budgets, and the user experience is paying the price. LLMs aren't getting dumber; they're being engineered to 'give up' early because web searches and deep reasoning loops are too expensive at scale."

2h31

sam lessin 🏴‍☠️@lessin

@bgurley of course ... and the control on this without people churning is the margin for these businesses ... the lack of framework for what to spend and how to evaluate an answer is the margin (just like ads a generation ago)

2h2791

Giuliano Giacaglia@giacaglia

@bgurley @thefriley A significant deflationary curve in compute costs, she mentioned that from GPT-4 to GPT-5.4, the cost of tokens dropped by approximately 97%

And they are highly focused on costs for inference

It’s a good watch https://youtu.be/TjrShuj_Zsg?si=0iE_4XBfJypPg3bX

1h1581

Heidi Legg@heidilegg

The simple fact that AI can create an amazing deck for me, but won’t allow me to download it into Google slides or Ppt (if anyone still uses that) so I can edit it and use it (rather than staring at like something I desperately want in a window shop but can’t buy at any cost) convinces me that AI is still not there.

1h2

Billy J. Hatler@billyjhatler

@bgurley I am having a similar experience Bill. Recently, I’ve had longer chats to get to good outcomes and almost have to coax the model to get to those outcomes. 🤘🏻

1h621

X Girls@thesoragirls

@bgurley I have especially noticed this with Gemini models.

2h531

Parvesh Deosarran 🐢@PADeosarran

@bgurley Meaning they’re deliberately avoiding web search for recency, even when prompted?

2h351

Aditya Mehrotra@AdityaKMehrotra

@bgurley Which models have you noticed this with? I’ve seen the same laziness with Sonnet lately, but not with the new Opus 4.8

Could be plan or query dependent

2h331

Bill Gurley@bgurley

@tyler__palmer 🤣🤣🤣

2h251

Mike Scroggins@SignalChainMike

@bgurley Yeah I agree, the Claude family has had a weird habit of telling you to go to bed or do other things. It definetly seems like their is some effort on its part to discourage people using it for long periods. Could be for mental health, but the cynical read is that its reduce cost.

2h79

Subramanya N@subramanya

@bgurley yeah. this is the part users will notice first: not the bill, but the model quietly doing less work. evals need to catch effort regression, not just wrong answers.

2h62

BlackJack@BlackJackPartII

@bgurley Interesting and makes sense. I’ve noticed the same thing recently on Claude. It pushes back on researching topics and is more active in shaping our discussions.

2h171

Nathan Quantum@AI_WarriorNQ

@bgurley noticed this too. reasoning tokens getting trimmed. models aren't dumber they're just being told to think less

1h53

Taikhoom Sojitrawala@TSojitrawa5253

@bgurley Yeah probably because they want their s-1 to be more appealing to investors.

2h47

Kim Benabib@KimBenabib

@bgurley Are you sure you're using Gemini? Don't think I've ever seen Gemini NOT do web search.

2h43

Gavin Tan@MoatOwl

@bgurley I noticed this under Opus 4.8 vs 4.7. Same prompts for company deep dives, output is 10-15% shorter with less effort put in.

1h121