What comes after today’s visual backbones?
At T4V @CVPR 2026, we’re bringing the community together for a focused half-day workshop on Transformers for Vision and Multimodal AI — covering image, video, 3D, MLLMs, efficient attention, SSMs/Mamba, and the next generation of visual architectures.
📍 Wed June 3 · Room 607
🕐 1:45–5:40 pm (Denver local time)
Invited speakers:
@RanjayKrishna, @thoma_gu, @sherryyangML, @jcniebles, @liuzhuang1234, and @TongPetersb.
Join us at CVPR:
https://sites.google.com/view/t4v-cvpr26/
@UNC @NVIDIAAI @NVIDIAAIDev @AIatMeta @ImagineEnpc @NJU1902 @BaskinEng
#CVPR2026 #T4V #MultimodalAI #NVIDIA