Fuli Luo, who builds Xiaomi's MiMo LLM, details how Hybrid Sliding Window Attention and KVCache optimizations cut MiMo-V2.5 serving costs · Digg