18h ago

Google DeepMind's Gabriel Barth-Maron demonstrates Gemini Omni translating spoken audio and dynamically adjusting video scene lengths

The translation preserves background music without requiring text prompts.

0
Original post

New day now Omni findings: it can translate audio (no original or translated text given in the prompt): - it keeps the background music intact - it adjusts the edit if needed. For example the japanese and spanish sentence during the creme close-up shot is longer, so it kept that shot longer and trims that edit point…

3:06 AM · May 25, 2026 View on X

Ok this is cool

László GaálLászló Gaál@laszlogaal_

New day now Omni findings: it can translate audio (no original or translated text given in the prompt): - it keeps the background music intact - it adjusts the edit if needed. For example the japanese and spanish sentence during the creme close-up shot is longer, so it kept that shot longer and trims that edit point…

10:06 AM · May 25, 2026 · 15.2K Views
1:22 AM · May 26, 2026 · 933 Views
Google DeepMind's Gabriel Barth-Maron demonstrates Gemini Omni translating spoken audio and dynamically adjusting video scene lengths · Digg