Furong Huang introduces DynaFLIP, a tri-modal framework that fuses vision, 3D motion, and language to improve robotic manipulation perception
It addresses noisy, incoherent features in encoders like DINOv2.
——0——
QUOTE POST
#465Furong Huang@FURONGH
Thanks for sharing our paper, more details to come!
DynaFLIP Rethinking Robotics Perception via Tri-Modal-Dynamics Guided Representation
4:23 PM · May 29, 2026 · 9.6K Views
7:35 PM · May 30, 2026 · 1.2K Views