Tim Dettmers, bitsandbytes creator, says Google DeepMind's TurboQuant is an invalid and unreplicable benchmark for KV cache compression
The dispute follows Shard claiming 10x KV cache compression.
——0——
@Tim_Dettmers But then how will they play sempai-notice-me games with deeeep mind?
Not to degrade from this work, but TurboQuant is not a competitive method nor a good benchmark. Researcher -- including me -- cannot replicate the TurboQuant paper, and even then, the performance is not great. Please. Just. Stop.
6:44 PM · May 26, 2026 · 19.6K Views
7:16 PM · May 26, 2026 · 765 Views