18h agoNVIDIA releases LocateAnything, a 3B local vision-language model for UI and object groundingA researcher critiqued the model's discrete coordinate tokens.SentimentSentimentPos0%Neg100%Some users dismissed NVIDIA's LocateAnything 3B Model as obviously worse than moondream.1 comment with sentiment. View comments.