12h agoVSTAT Benchmark Exposes Multimodal LLMs' Failures in Video State TrackingSentimentSentimentPos100%Neg0%Positive users express gratitude toward the collaborators behind the VSTAT benchmark for exposing multimodal LLMs' failures in video state tracking.1 comment with sentiment. View comments.