VIEWS19

Andrei Bursuc @CVPR@abursuc
On the architecture side: - introduce camera registers and do x-attention on them between images - reduce num. of heads in multi-task training - replace high-res conv layer w/ MLP + PixelShuffle Outcome: 70% training memory reduction -> “gpus don’t go boom” #cvpr2026
4hViews 19