3h ago

Tim Dettmers, bitsandbytes creator, says Google DeepMind's TurboQuant is an invalid and unreplicable benchmark for KV cache compression

The dispute follows Shard claiming 10x KV cache compression.

0
Original post

Not to degrade from this work, but TurboQuant is not a competitive method nor a good benchmark. Researcher -- including me -- cannot replicate the TurboQuant paper, and even then, the performance is not great. Please. Just. Stop.

11:44 AM · May 26, 2026 View on X

@Tim_Dettmers But then how will they play sempai-notice-me games with deeeep mind?

Tim DettmersTim Dettmers@Tim_Dettmers

Not to degrade from this work, but TurboQuant is not a competitive method nor a good benchmark. Researcher -- including me -- cannot replicate the TurboQuant paper, and even then, the performance is not great. Please. Just. Stop.

6:44 PM · May 26, 2026 · 19.6K Views
7:16 PM · May 26, 2026 · 765 Views
Tim Dettmers, bitsandbytes creator, says Google DeepMind's TurboQuant is an invalid and unreplicable benchmark for KV cache compression · Digg