*NEW PAPER&MODEL* HOI-DETR for detecting hands, object in-hand, &object-interacted-with (through a tool). This is the foundation model you've been waiting for! One that works off-the-shelf, single-image, using strong detectors, generalises and is stable https://ahmaddarkhalil.github.io/HOI-DETR/ 🧵
No Digg Deeper questions have been answered for this story yet.