NVIDIA releases LocateAnything, a 3B local vision-language model for UI and object grounding · Digg