Google DeepMind releases Gemini Omni, a model that combines Gemini reasoning systems with generative media tools to produce and edit video from varied inputs with agentic behavior and Workspace integration

Original post

Google I/O is tomorrow, last chance to get predictions in. I love to guess, so here's mine:

The Google team is being strangely quiet about the new Gemini. At this point everyone knows it is arriving tomorrow, along with their personal agent named Spark. This reticence, of course, can be interpreted in many ways. I'm choosing to interpret it in accordance with my nature.

I think they trained the largest model they've ever successfully trained - possibly the largest one anyone ever has. And something unexpected emerged at scale. They had their Mythos moment, but not in the same way Anthropic did. Gemini has always been a very different model from Claude.

The benchmarks will go out tonight under embargo (they probably already are), but I don't think they will fully reflect what I'm talking about. I think they hit something they weren't even aiming for. Something that surprised them. If I'm right, that surprise will be part of tomorrow's show. We shall find out together in the morning.

3:52 PM · May 18, 2026 · 155.2K Views

TWITTER.COMVia

TWITTER.COMVia

GOOGLE DEEPMINDVia

GOOGLE LABSVia

VIEWS1.2M

Google@Google

Introducing Gemini Spark ✨

It’s your 24/7 personal AI agent that helps you navigate your digital life, taking action on your behalf, and under your direction.

🧠 It runs on Gemini 3.5 and is built on @Antigravity, so it can perform long-running tasks easily in the background.

⏱️ And because it runs on dedicated virtual machines on Google Cloud, you don’t even need to keep your laptop open.

🧰 Spark will integrate seamlessly with Google tools, and soon with third parties through MCP.

#GoogleIO

41d1.2M5.5K1.9K

BOOKMARKS2.4K

Google DeepMind@GoogleDeepMind

We’re dropping Gemini Omni: our first step towards a model that can create anything from anything - starting with video.

It combines Gemini’s intelligence with our generative media systems - representing a leap forward in world understanding, multimodality, and editing 🧵

41d925.9K7.9K2.4K

LIKES16.4KRETWEETS1.3KREPLIES522

Google@Google

The rumors are true…

Today, we’re introducing the Gemini 3.5 model series.

#GoogleIO

41d800.3K16.4K888

Marques Brownlee@MKBHD

It is getting genuinely difficult to keep track of all of the names of AI products being unveiled. In the last hour, Google's unveiled Google Pics (which is not Google Photos), and updates to Google Flow, Nano Banana, Veo (all media generation), Google Antigravity, Gemini Spark, Gemini Omni, Gemini 3.5 Flash

41d765.6K14.2K1.1K

Demis Hassabis@demishassabis

Gemini Omni is a major leap in world understanding & multimodal editing! It can take photos, video & audio and build entirely new scenes. Over time it’ll be able to handle any input & any output - starting w/ video

You can even give it your own videos & iterate on your ideas:

41d856.9K9.3K1.4K

Logan Kilpatrick@OfficialLoganK

Introducing Gemini Omni 🔮........ Omni is our new model that can create anything from any input — starting with video (think Nano Banana but for video). Available in the Gemini App, Flow, and YouTube, with API support coming soon!

41d1.1M5.7K2K

Sundar Pichai@sundarpichai

Gemini Omni doesn't just build scenes that look real, it reasons about what should happen next. It combines an intuitive understanding of physics with Gemini's knowledge of history, science, and cultural context.

Rolling out today starting with video outputs to Google AI Plus, Pro and Ultra subscribers globally through the @Geminiapp + Google Flow, and @YouTube Shorts this week.

41d514.3K6.7K1.1K

Google Gemini@GeminiApp

Gemini Omni is here, and we’ve been seeing amazing creations all week. Here are some standouts 👇

39d757K5.6K1.3K

Google AI@GoogleAI

Some fun Gemini Omni use cases from the community👇🧵

(We’ll keep updating this thread throughout the day)

41d165.6K1K785

Pushmeet Kohli@pushmeet

The results of the research happening in my team @GoogleDeepMind have convinced me that the next era of scientific discovery will be aided by AI agents acting as force multipliers for human ingenuity.

That’s why I’m proud to introduce Gemini for Science - a collection of experimental science tools designed to support researchers at every stage of the research process. The tools include:

1️⃣ Literature Insights, built with Google NotebookLM, searches millions of scientific papers to synthesize findings and generate artifacts including data tables, slides, reports, and more.

2️⃣ Hypothesis Generation, built with Co-Scientist, simulates the scientific method via a multi-agent "idea tournament" to generate, debate, and rigorously evaluate research hypotheses.

3️⃣Computational Discovery, built with AlphaEvolve and ERA, is an agentic engine that generates and scores thousands of code variations in parallel, allowing researchers to test modeling approaches in fields like epidemiology in a fraction of the usual time.

Read more: https://blog.google/innovation-and-ai/technology/research/gemini-for-science-io-2026/

41d136.2K1.3K650

Google DeepMind@GoogleDeepMind

We want to help scientists discover their next breakthrough with AI.

Gemini for Science is our new suite of experimental tools to help them explore more hypotheses, validate work at scale, unpack literature with ease, and more 🧵

41d114.9K1.4K590

Logan Kilpatrick@OfficialLoganK

Say hello to Gemini Spark, your dedicated agent through the @GeminiApp! It runs on a dedicated virtual machine, can be fully connected to all of your Google info, and is paired with an awesome new UI in the mobile and web app, it looks and feels awesome!

Google@Google

Introducing Gemini Spark ✨

It’s your 24/7 personal AI agent that helps you navigate your digital life, taking action on your behalf, and under your direction.

🧠 It runs on Gemini 3.5 and is built on @Antigravity, so it can perform long-running tasks easily in the background.

⏱️ And because it runs on dedicated virtual machines on Google Cloud, you don’t even need to keep your laptop open.

🧰 Spark will integrate seamlessly with Google tools, and soon with third parties through MCP.

#GoogleIO

41d290.4K2K550

fofr@fofrAI

Editing videos is where Gemini Omni Flash really shines. It is so incredibly capable.

> Make it New Year's Eve with fireworks. Update the clock

London launched the fireworks early.

fofr@fofrAI

Gemini Omni Flash:

> a recording from a capsule on the london eye, a jerky zoom into something in the distance and then refocusing (with a bit of back and forth) (no timestamp or dialog)

Note the world knowledge of London’s landscape, and the way the video is gently moving like the capsules do.

41d188.6K1.2K540

Josh Woodward@joshwoodward

Gemini Omni is so fun - insanely great at editing videos!

41d95.4K1.1K223

Daniel Sinclair@_DanielSinclair

Sir Nobel Laureate Hassabis, the new undifferentiated slop product is ready for you to post.

Born to discover the origins of life and tame nature & reality — but forced to harvest some image stills for a world model and peddle pre-rolls for YouTube Shorts. Do you think it hurts

Demis Hassabis@demishassabis

You can even give it your own videos & iterate on your ideas:

41d124.3K995171

fofr@fofrAI

Gemini Omni Flash:

> a recording from a capsule on the london eye, a jerky zoom into something in the distance and then refocusing (with a bit of back and forth) (no timestamp or dialog)

Note the world knowledge of London’s landscape, and the way the video is gently moving like the capsules do.