6h ago

Flat-Pack Bench Tests LVLMs On Assembly Task Reasoning

——0——
Original post
Noah SnavelyNS#1097@JIMANTHAOPAditya Chetan 🛫 #CVPR2026ACAditya Chetan 🛫 #CVPR2026|@JUSTACHETAN

Humans can watch tasks like cooking or assembly and reason about what happened, when, and between which parts. Can LVLMs do the same? We built Flat-Pack Bench to test this – and found there is still a long way to go. Accepted at #CVPR2026! 🎥🪑🧩(1/n)

11:55 AM · May 29, 2026 View on X
11651725

Cluster engagement

38 snapshots