Google DeepMind's Gabriel Barth-Maron demonstrates Gemini Omni translating spoken audio and dynamically adjusting video scene lengths
The translation preserves background music without requiring text prompts.
——0——
QUOTE POST
#1245Alex Volkov@ALTRYNE
Ok this is cool
New day now Omni findings: it can translate audio (no original or translated text given in the prompt): - it keeps the background music intact - it adjusts the edit if needed. For example the japanese and spanish sentence during the creme close-up shot is longer, so it kept that shot longer and trims that edit point…
10:06 AM · May 25, 2026 · 15.2K Views
1:22 AM · May 26, 2026 · 933 Views