NVIDIA releases LocateAnything-3B, a vision-language model that predicts coordinates in parallel to speed up agent localization · Digg