Google DeepMind's Thomas Kipf presents multimodal reference conditioning with Gemini Omni at Google I/O, crossing out Transformers on stage in favor of Graph Convolutional Networks
Technique integrates multiple data modalities for flexible model outputs.
——0——
@tkipf 😂
Gemini Omni allows me to step into an alternative timeline where Graph Convolutional Nets (GCNs) made it to the big stage 🙃 Jokes aside: excited to finally share how far we've come with multimodal reference conditioning.
6:58 PM · May 19, 2026 · 4.3K Views
8:57 PM · May 19, 2026 · 116 Views