16h agoStateKV scales pretrained video VLMs linearly with video length at inference time without retrainingIt reduces GFLOPs while preserving accuracy on VideoMME.