/Tech1d ago

Claude API outage triggers reports of cross-tenant leaks due to suspected KV cache indexing errors

AI Judge changed title after evaluation, original title: "LLM inference bug exposing customer generation outputs prompts debate over KV cache isolation and routing errors"

Anthropic has not officially verified if customer data leaked

731.1K38146127.7K

#140

Original post

Matthew Berman@MatthewBerman#1088inTech

Should I make http://willclaudequotareset.com?

11:06 AM · Jun 5, 2026 · 9.4K Views

Sentiment

Users criticized AI API providers for repeated data leaks exposing customer information, viewing them as evidence of misplaced priorities and inherent distrust in cloud services.

Pos

0.0%

Neg

100.0%

7 comments with sentiment.

Cluster Engagement

Posts from X

Most Activity

VIEWS76.8KBOOKMARKS93LIKES706RETWEETS17

kalomaze@kalomaze

ahahaha oh man this is the scariest kind of kv cache bug

23h76.8K70693

REPLIES27

Chubby♨️@kimmonismus

Reports claim Claude’s API may have returned another user’s inference output during today’s outage.

Anthropic’s status page confirms elevated errors affecting Claude API, Claude Code, Claude. ai and Claude Cowork but Anthropic has not confirmed a customer data leak yet.

That would be a cross-tenant isolation failure and would be a worst-case scenario.

21h25.1K17435

Beff (e/acc)@beffjezos

The downside of serving batches of customers on the same device is that you're one indexing error away from accessing other's output tokens

In the end, people will want private personal compute for inference. Lower latency and more power efficient.

20h10.1K10714

Teortaxes▶️ (DeepSeek 推特🐋铁粉 2023 – ∞)@teortaxesTex

Incredible how often this happens even to "serious" labs Seen it with DeepSeek (long ago tbh, they're very good at cache now), Grok, and now Claude?

19h4.4K390

Jake@JakeKAllDay

@kalomaze "a new, dynamic shared KV cache cutting inference costs 99%"

19h475192

Victor Kurako@kurakovictor

@kalomaze They said it was “hallucinations” publicly lmao

20h7999

Roko ʕ •ᴥ•ʔっ🪄✨🐍@rokobasili

@kalomaze @DanielleFong uh oh spaghetti o @vGPUArthur

23h9521

William@robot__fan

@kalomaze do you think its more likely that youre actually getting someone elses exact kv cache (ie "leaking" data as in the qrt) or just some perturbation/combination that makes it look like that, but youre decoding not garbage per se but from a kv cache state that doesnt actually exist?

19h6814

ThomAub@ThomAub

@kalomaze Cache invalidation or off by one error?

22h1.3K6

Paweł J Lisowski@PawelJLisowski

@kimmonismus Why they keep having issues like this while having unlimited access mythos? Makes me wonder how much of that hype is real.

21h1223

Louis Mullie, MD@LouisMullie

@robot__fan @kalomaze completely junk kv would produce more degenerate outputs than what is shown

this indeed looks like kv cache bleeding across request boundaries

much easier than people realize for this to happen by accident

honestly bad look to call this ‘hallucinations’ with a straight face

19h683

Jake@jacobrhinehart

@kimmonismus I know we say this weekly, but next week is pivotal imo

21h394

brian.fm@brianfm_the

@kimmonismus play stupid games, win stupid prizes.

all that rich request body inspection and routing kerfuffle after openclaw and friends stressed capacity constraints could very well be causing pain?

20h1151

Chubby♨️@kimmonismus

@jacobrhinehart somehow every week feels pivotal

21h362

Guilherme O'Tina@guilhermeotina

yeah this is the radix attention nightmare. shared prefix caches partition blocks by request ids, and crash recovery scrambles that mapping. the worst part: it generates fluent output from the wrong context, so you need a user noticing 'hey this is someone else's conversation' to catch it

20h6753

Ben (no treats)@andersonbcdefg

@kalomaze dang cant believe mythos let this one slip through

18h3032

William@robot__fan

@LouisMullie @kalomaze well yeah. obv hallucinations is wrong too, but wouldnt a kv thats a concat / wrong window of two (k?) valid non-junk kvs potentially produce results that are relevant to neither yet still sound coherent?

17h23

Old Billy PhD (Player Hater Degree)@realOldBilly

@kalomaze @DanielleFong Haha I wonder if part of the cache key is unset or something

22h1.2K

@Coldly@Just_Codly

@kimmonismus if it really crossed users how does anthropic even prove afterwards whose data went where?

21h2541

Nathan Odle@mov_axbx

@beffjezos Agree on local personal inference but there will always be inference endpoints regardless and they should be implementing this:

https://arxiv.org/abs/2603.14283

19h792