/AI5h ago

Fei-Fei Li Introduces StereoPolicy for Stereo Cues in Robot Manipulation

--0--
Original posts
Reposts
Original postFei-Fei Li#12
Ruohan Zhang@RuohanZhang76

Excited to introduce StereoPolicy, led by @EvansXuHan.

馃摲馃摲馃StereoPolicy is an effective way to add geometric cues to modern robot policy models while keeping the strengths of pretrained 2D encoders.

鈦夛笍Why stereo for robot manipulation?

Monocular RGB often lacks the depth cues needed for precise manipulation, while RGB-D and point clouds can be noisy or brittle, especially on reflective and transparent objects in real-world deployment.

Instead of explicitly reconstructing disparity, depth, or point clouds, StereoPolicy directly fuses synchronized left/right RGB views to learn implicit stereo cues, avoiding extra reconstruction latency that can make real-time manipulation difficult.

Project Page: https://stereopolicy.github.io

1:58 PM 路 Jun 3, 2026 路 10.6K Views
Sentiment
Sentiment building, check back later.
Cluster Engagement
-
Views
-
Comments
-
Reposts
-
Bookmarks
Expand data
Posts from X
Most Activity
Most ActivityTimeline
No ranked X posts are available for this story yet.