/Tech21d ago

LangSmith Adds LLM Gateway For AI Agent Cost Governance

2194152218.9K

#698

Original post

Harrison Chase#698

LangChain@LangChain#971inTech

Your agents can burn through $10k overnight before you notice.

LangSmith LLM Gateway stops that.

The platform where you already observe, evaluate, and deploy your agents now has a governance layer.

1:26 PM · May 27, 2026 · 11.3K Views

Sentiment

Many users praise LangChain's LangSmith LLM Gateway cost controls because runtime guardrails and hard caps make agent spending predictable while accelerating development instead of creating friction.

Pos

100.0%

Neg

0.0%

10 comments with sentiment.

Cluster Engagement

Digg Deeper

No Digg Deeper questions have been answered for this story yet.

Posts from X

Most Activity

VIEWS5.6KBOOKMARKS11LIKES33REPLIES8

Harrison Chase@hwchase17

Cost controls are one of the main benefits of langsmith LLM gateway

We’re seeing in a bunch of use cases adoption is getting to a point where cost really starts to matter

LangChain@LangChain

Your agents can burn through $10k overnight before you notice.

LangSmith LLM Gateway stops that.

The platform where you already observe, evaluate, and deploy your agents now has a governance layer.

21d5.6K3311

RETWEETS4

Caspar Broekhuizen@caspar_br

Governance should lead to more building, not less

Agent development is so much more fun when people don't need to ration tokens because one runaway agent might blow up the bill

Understand spend, cap the scary cases, and teams can experiment much more freely

LangChain@LangChain

Your agents can burn through $10k overnight before you notice.

LangSmith LLM Gateway stops that.

The platform where you already observe, evaluate, and deploy your agents now has a governance layer.

20d2K102

LangChain@LangChain

A TLDR on LangSmith LLM Gateway

https://www.langchain.com/blog/introducing-llm-gateway

21d1.5K61

Andre zamuel@Andrezamue99rr

@LangChain Exponential agent scaling catches most teams off-guard. At what point does manual cost monitoring become unrealistic?

21d105

MITCHELLEX FELLOWSHIP@Mitchellex4God

@LangChain Spend limits + PII redaction + trace continuity at the runtime layer. Finally a governance solution that doesn't slow down agent development. Practical for production teams.

21d3

Sourav Bera@souravxbera

@LangChain Is this available for all LLM providers or just OpenAI/Anthropic for now? The cost governance looks solid.

20d462

Pothu@pothuLabs

@hwchase17 Cost controls are the feature you don't notice until the surprise bill arrives. Most teams find out their agent looped on an expensive model for a week. A gateway with hard caps is cheaper insurance than any monitoring tool.

20d148

a.lihachöv@avlihachev

@hwchase17 cost only bites when someone runs a recursive agent loop on a long doc. one bad MCP tool call can 10x a session. per-session budget caps + prompt caching caught more incidents than any usage dashboard for me.

20d161

Sam Xu@sam_commonly

@hwchase17 Gateway caching/routing controls per-call cost. The harder layer is per-task — an agent loop can run 30+ calls before stopping, and the gateway can't see where the loop should have terminated. Cost-aware budgeting probably belongs in the harness, not the gateway.

21d32

Tiago Rama@tiagobuilds

@LangChain Runaway agents are the boring failure mode nobody screenshots.

Autonomy needs budgets before it needs another demo.

21d31

狐狸布布@baibaida

@hwchase17 been watching this exact problem hit our team, one agentic pipeline went from $200/day to $3k overnight after someone changed a retry loop. the gateway approach makes sense, you need the guardrails at the infra layer not in app code where devs forget to set limits

21d30

Kailash@MikroJaxi

@LangChain the difficult part of AI systems is slowly shifting from intelligence → control cost, observability, permissions, evals, governance

21d26

TravelerOfCode@TravelerOfCode

@hwchase17 the cost arc on every saas eventually hits the 'we actually need limits' moment. agent platforms hit it 18 months earlier than typical saas because per-call cost is wildly variable. gateways are the right primitive here

20d24

Eclipse 🌖@ECLresearch

@hwchase17 Makes sense — observed a few teams where LLM costs quietly ate 30%+ of infra budget post-POC; a proper gateway with routing + caching turns that line item from variable to predictable.

21d24

Stefan Stefanović@proofmancer

agreed. spend is just the first axis though. the scarier runaway isn't the bill, it's not knowing what the agent actually did. cap the spend, attest the actions, and you can finally let agents off the leash. it's why we build iris with the wallet at the org and a receipt for every call.

20d13

Pablo Pablo@navtechai

@hwchase17 The change isn't the policy tool itself. It's that deployments have scaled enough for cost to become a real production constraint.

20d13

char@pablogavipedri

@Andrezamue99rr @LangChain Exactly! This is why early intervention matters. What threshold makes automated cost governance a must-have?

21d7

char@pablogavipedri

@Mitchellex4God @LangChain Runtime governance shouldn't feel like friction. Kudos to LangChain for proving it can accelerate development rather than hinder it.

21d4

Strata@ChainZenit

@hwchase17 cost control always ends up being the real bottleneck eventually

21d3

Sven Nachtzeit@SvenUrbanSci

@caspar_br Precisely. Token rationing kills iteration. Hard caps at the gateway let me focus on agent performance instead of spend risk.

20d2