7h ago

Google's Gemini Omni model generates first-person video of a taxi following a route from a marked Google Maps screenshot with consistent viewpoint and visual details

Model synthesized dynamic simulation from static map imagery.

0
Original post

I uploaded a screenshot of Google Maps to Gemini Omni with a route drawn on it. Then I prompted it to create a first person view of someone driving a taxi cab along the route in the reference image. Pretty close to the real thing.

4:58 PM · May 21, 2026 View on X

World Models ftw :)

CHRIS FIRSTCHRIS FIRST@chrisfirst

I uploaded a screenshot of Google Maps to Gemini Omni with a route drawn on it. Then I prompted it to create a first person view of someone driving a taxi cab along the route in the reference image. Pretty close to the real thing.

11:58 PM · May 21, 2026 · 28.4K Views
3:13 AM · May 22, 2026 · 3.2K Views

remarkable result

CHRIS FIRSTCHRIS FIRST@chrisfirst

I uploaded a screenshot of Google Maps to Gemini Omni with a route drawn on it. Then I prompted it to create a first person view of someone driving a taxi cab along the route in the reference image. Pretty close to the real thing.

11:58 PM · May 21, 2026 · 28.4K Views
4:21 AM · May 22, 2026 · 3.9K Views

ok im trying this tomorrow

CHRIS FIRSTCHRIS FIRST@chrisfirst

I uploaded a screenshot of Google Maps to Gemini Omni with a route drawn on it. Then I prompted it to create a first person view of someone driving a taxi cab along the route in the reference image. Pretty close to the real thing.

11:58 PM · May 21, 2026 · 28.4K Views
3:54 AM · May 22, 2026 · 1.5K Views

YouTube + Maps is one helluva data moat for Google. Case in point:

CHRIS FIRSTCHRIS FIRST@chrisfirst

I uploaded a screenshot of Google Maps to Gemini Omni with a route drawn on it. Then I prompted it to create a first person view of someone driving a taxi cab along the route in the reference image. Pretty close to the real thing.

11:58 PM · May 21, 2026 · 28.4K Views
6:27 AM · May 22, 2026 · 1.1K Views