Google's Gemini 3.5 Flash appears in Cloud Console quota interface under gemini-3.5-flash identifier at $1.5 per million input tokens and $9 per million output tokens
Quota limits include up to 400 million tokens and multiple unlimited categories.
1/ Today at #GoogleIO, we’re releasing Gemini 3.5, our latest family of models combining frontier intelligence with action.
We’re starting by releasing 3.5 Flash, which is built to help you execute complex, long-horizon agentic workflows.
Gemini 3.5 Flash is our strongest model for coding and agent http://yet.It outscores 3.1 Pro on agentic and coding benchmarks like Terminal-Bench and MCP Atlas, while running 4x faster than other frontier models.
Used in Google Antigravity, 3.5 Flash is even further optimized to be up to 12x faster. It’s a powerful engine to deploy sub-agents that collaborate, run high-frequency iterative loops, and solve real-world problems at scale.
Some highlights we’re excited about 🔽

2/ Check out how Gemini 3.5 Flash instantly digests dense academic papers and autonomously codes a fully interactive, visual website explaining the intricacies of the research. It's an incredible stress test that seamlessly merges massive long context, deep reasoning, complex coding, and ultra-low latency.
It really helps you distill papers down to their essence and aid your understanding!
1/ Today at #GoogleIO, we’re releasing Gemini 3.5, our latest family of models combining frontier intelligence with action. We’re starting by releasing 3.5 Flash, which is built to help you execute complex, long-horizon agentic workflows. Gemini 3.5 Flash is our strongest model for coding and agent http://yet.It outscores 3.1 Pro on agentic and coding benchmarks like Terminal-Bench and MCP Atlas, while running 4x faster than other frontier models. Used in Google Antigravity, 3.5 Flash is even further optimized to be up to 12x faster. It’s a powerful engine to deploy sub-agents that collaborate, run high-frequency iterative loops, and solve real-world problems at scale. Some highlights we’re excited about 🔽
3/ Gemini 3.5 Flash is rolling out globally today. On behalf of the entire Gemini team, we're excited by what you'll be able to do with this model!
Read more here: https://blog.google/innovation-and-ai/models-and-research/gemini-models/gemini-3-5/
2/ Check out how Gemini 3.5 Flash instantly digests dense academic papers and autonomously codes a fully interactive, visual website explaining the intricacies of the research. It's an incredible stress test that seamlessly merges massive long context, deep reasoning, complex coding, and ultra-low latency. It really helps you distill papers down to their essence and aid your understanding!
1/ Today at Google I/O, we’re launching Gemini 3.5 Flash ⚡️⚡️⚡️!
Our mission was clear: bring frontier-level intelligence with unprecedented speed.
3.5 Flash delivers drastic intelligence (beating 3.1 Pro on almost every benchmark), at Flash speeds. 🧵

2/ What excites me most is 3.5 Flash’s breakthrough performance on complex, multi-step agentic tasks: it excels on Terminal-Bench 2.1 (76.2%), GDPval-AA (1656 Elo) and MCP Atlas (83.6%)
3.5 Flash is built to be the ultra-fast reasoning engine powering your AI agents. The better the model, the better the agent you build on top of it.

1/ Today at Google I/O, we’re launching Gemini 3.5 Flash ⚡️⚡️⚡️! Our mission was clear: bring frontier-level intelligence with unprecedented speed. 3.5 Flash delivers drastic intelligence (beating 3.1 Pro on almost every benchmark), at Flash speeds. 🧵
4/ Gemini 3.5 Flash is available today globally - we can't wait to see the incredible agents and apps you all build with it!
Check out our blog for more. https://blog.google/innovation-and-ai/models-and-research/gemini-models/gemini-3-5/
3/ If you push Gemini 3.5 Flash with complex agents and tasks, it will shine brighter. Here, it generates fully tactile, interactive HTML/SVG hardware simulations, complete with bump-mapped metals, spring physics, and procedural audio, in a single shot. This is what happens when you connect reasoning across the full stack simultaneously at extremely low latency. 💡🔉
3/ If you push Gemini 3.5 Flash with complex agents and tasks, it will shine brighter. Here, it generates fully tactile, interactive HTML/SVG hardware simulations, complete with bump-mapped metals, spring physics, and procedural audio, in a single shot. This is what happens when you connect reasoning across the full stack simultaneously at extremely low latency. 💡🔉
2/ What excites me most is 3.5 Flash’s breakthrough performance on complex, multi-step agentic tasks: it excels on Terminal-Bench 2.1 (76.2%), GDPval-AA (1656 Elo) and MCP Atlas (83.6%) 3.5 Flash is built to be the ultra-fast reasoning engine powering your AI agents. The better the model, the better the agent you build on top of it.
excited by gemini 3.5 flash but also sad that it's "pushing the frontier of ... cost" not cost effectiveness 😭
Welcome to Gemini 3.5 Flash, our most powerful model to date. It pushes the frontier of intelligence, speed, and cost putting 3.5 Flash in a class of its own. We spent the last 6 months making sure Flash is great for real world use cases. It's available everywhere now!
Welcome to Gemini 3.5 Flash, our most powerful model to date. It pushes the frontier of intelligence, speed, and cost putting 3.5 Flash in a class of its own.
We spent the last 6 months making sure Flash is great for real world use cases. It's available everywhere now!

Try it in the Gemini API, Google AI Studio, Antigravity, AI Mode, Gemini App, and wherever else you use Gemini!
Welcome to Gemini 3.5 Flash, our most powerful model to date. It pushes the frontier of intelligence, speed, and cost putting 3.5 Flash in a class of its own. We spent the last 6 months making sure Flash is great for real world use cases. It's available everywhere now!
Just off stage at #GoogleIO, some highlights from this morning 🧵
Gemini 3.5 Flash is available today for everyone in @antigravity and across our products and APIs.
Compared to 3.1 Pro, 3.5 Flash is better across almost all benchmarks with huge progress in coding. It’s also comparable to the best models but very fast (4x faster tokens/ second than other frontier models). And when looking at the intelligence versus output speed, it’s in a league of its own in the top right quadrant.

Google Antigravity is expanding, including a new standalone desktop app that acts as a central home for agent interaction. We’re also introducing a new Antigravity CLI providing a fast, lightweight way to deploy new agents instantly without a graphical user interface, and a new Antigravity SDK, to give direct access to the same agent harness powering our own products to customize and host agents on your own infrastructure.
With 3.5 Flash in Antigravity, developers can now do so much more. The new Antigravity ecosystem is coming to developers today.
Just off stage at #GoogleIO, some highlights from this morning 🧵 Gemini 3.5 Flash is available today for everyone in @antigravity and across our products and APIs. Compared to 3.1 Pro, 3.5 Flash is better across almost all benchmarks with huge progress in coding. It’s also comparable to the best models but very fast (4x faster tokens/ second than other frontier models). And when looking at the intelligence versus output speed, it’s in a league of its own in the top right quadrant.
Gemini Spark is your personal AI agent in the @GeminiApp that gets things done on your behalf, under your direction. It runs 24/7 (and yes - you can close your laptop). It’s powered by Gemini 3.5 and built on the Google Antigravity harness so it can complete long horizon tasks.
Spark will integrate seamlessly with tools, starting with ours, and soon with 3P tools with MCP. You’ll also be able to work with it through email + chat. Available to trusted testers this week and next week in Beta to AI Ultra users in the US.

Google Antigravity is expanding, including a new standalone desktop app that acts as a central home for agent interaction. We’re also introducing a new Antigravity CLI providing a fast, lightweight way to deploy new agents instantly without a graphical user interface, and a new Antigravity SDK, to give direct access to the same agent harness powering our own products to customize and host agents on your own infrastructure. With 3.5 Flash in Antigravity, developers can now do so much more. The new Antigravity ecosystem is coming to developers today.
Gemini Omni is our new model that can create anything from any input - starting with video. It combines Gemini’s intelligence with our generative media models, for a new level of world understanding, multimodality, and editing.
Gemini Omni Flash is rolling out today to Google AI Plus, Pro and Ultra subscribers globally through the @Geminiapp and Google Flow, to @YouTube Shorts this week, and to our developer and enterprise APIs in the coming weeks.
Gemini Spark is your personal AI agent in the @GeminiApp that gets things done on your behalf, under your direction. It runs 24/7 (and yes - you can close your laptop). It’s powered by Gemini 3.5 and built on the Google Antigravity harness so it can complete long horizon tasks. Spark will integrate seamlessly with tools, starting with ours, and soon with 3P tools with MCP. You’ll also be able to work with it through email + chat. Available to trusted testers this week and next week in Beta to AI Ultra users in the US.
As models get even better, the need for transparency grows. Last year @nvidia adopted our SynthID invisible watermark, and today we’re excited to announce @OpenAI, Kakao, and @ElevenLabs will join them.
We’re also going further by adding C2PA Content Credentials verification to @GeminiApp alongside Synth ID detection and bringing both to Search and Chrome so you can easily check whether content was captured by a camera, or created/ edited with gen AI tools.

Gemini Omni is our new model that can create anything from any input - starting with video. It combines Gemini’s intelligence with our generative media models, for a new level of world understanding, multimodality, and editing. Gemini Omni Flash is rolling out today to Google AI Plus, Pro and Ultra subscribers globally through the @Geminiapp and Google Flow, to @YouTube Shorts this week, and to our developer and enterprise APIs in the coming weeks.
Gemini 3.5 Flash is transforming what you can do in Google Search with new agentic capabilities. A few things we’re introducing:
A new intelligent AI-powered Search box, our biggest upgrade in 25 years — rolling out globally.
New information agents that work in the background 24/7 to find exactly what you need at the right moment, and help you take action (coming this summer).
And with the power of Google Antigravity, we’re unlocking new agentic coding capabilities, so Search can build custom interactive experiences – like visual simulations, dashboards or trackers – for your individual questions.
Read more: https://blog.google/products-and-platforms/products/search/search-io-2026
As models get even better, the need for transparency grows. Last year @nvidia adopted our SynthID invisible watermark, and today we’re excited to announce @OpenAI, Kakao, and @ElevenLabs will join them. We’re also going further by adding C2PA Content Credentials verification to @GeminiApp alongside Synth ID detection and bringing both to Search and Chrome so you can easily check whether content was captured by a camera, or created/ edited with gen AI tools.
We’re having much more natural conversations with Gemini directly inside many of our products, so we’re bringing this to two more:
Ask YouTube is a new experience for searching for content on YouTube. It gives you information in an easy to navigate layout, with videos best matched to what you’re looking for, and jumps right to the part most relevant to your query.
With voice-powered Docs Live you can brain dump whatever is on your mind, and let Gemini do the rest. Rolling out this summer, and the same voice capability is coming to Gmail and Keep then too.
Gemini 3.5 Flash is transforming what you can do in Google Search with new agentic capabilities. A few things we’re introducing: A new intelligent AI-powered Search box, our biggest upgrade in 25 years — rolling out globally. New information agents that work in the background 24/7 to find exactly what you need at the right moment, and help you take action (coming this summer). And with the power of Google Antigravity, we’re unlocking new agentic coding capabilities, so Search can build custom interactive experiences – like visual simulations, dashboards or trackers – for your individual questions. Read more: https://blog.google/products-and-platforms/products/search/search-io-2026
Also had some early access to Gemini 3.5 Flash. Very fast for a flash model and very capable, though not as powerful as a full frontier model.
I added it to the gallery or procedurally generated one-shot towns (it made one error that it corrected): https://hg-20f7d1a3ce.netlify.app/#gemini-3-5-flash

The metrics on 3.5 Flash are major: 4x faster than other frontier models in output tokens per second Outperforms our previous 3.1 Pro model on nearly all benchmarks Shows massive improvement on coding and agentic benchmarks like Terminal-Bench 2.1 (76.2%), GDPval-AA (1656 Elo) and MCP Atlas (83.6%)

Live from #GoogleIO, we’re introducing Gemini 3.5 Flash, our latest model with frontier performance for agents and coding. We’re rolling it out globally today, delivering frontier-level performance for real-world agentic workflows at 4x the speed of other frontier models. 🧵
Live from #GoogleIO, we’re introducing Gemini 3.5 Flash, our latest model with frontier performance for agents and coding. We’re rolling it out globally today, delivering frontier-level performance for real-world agentic workflows at 4x the speed of other frontier models. 🧵

The combination of performance and speed makes this model ideal for long-horizon and agentic tasks. Our enterprise partners are already seeing real-world results and accelerating their daily workflows. Read more about Gemini 3.5 in our blog: https://blog.google/innovation-and-ai/models-and-research/gemini-models/gemini-3-5/
The metrics on 3.5 Flash are major: 4x faster than other frontier models in output tokens per second Outperforms our previous 3.1 Pro model on nearly all benchmarks Shows massive improvement on coding and agentic benchmarks like Terminal-Bench 2.1 (76.2%), GDPval-AA (1656 Elo) and MCP Atlas (83.6%)
Today at Google I/O, we introduced Gemini 3.5 Flash! It has become an integral part of our daily research cycle and works with all the tools we have at Google. We used a team of agents in Antigravity 2.0 to recreate the original AlphaZero research paper and build a playable version. They coded the reinforcement learning pipeline in JAX/Flax, trained a ResNet model from scratch via self-play on multi-TPU pods, and shipped a full-stack web app so you can play against it, from just 2 prompts. . Here’s what else makes 3.5 Flash special 🧵
Gemini 3.5 Flash delivers sustained frontier-level performance at lightning-quick speeds: Beats 3.1 Pro on coding & agentic benchmarks Terminal-Bench 2.1 (76.2%), GDPval-AA (1656 Elo) and MCP Atlas (83.6%) 4x faster than other frontier models (12x in Antigravity!) SOTA on multimodality with 83.6% on MMMU-Pro

Today at Google I/O, we introduced Gemini 3.5 Flash! It has become an integral part of our daily research cycle and works with all the tools we have at Google. We used a team of agents in Antigravity 2.0 to recreate the original AlphaZero research paper and build a playable version. They coded the reinforcement learning pipeline in JAX/Flax, trained a ResNet model from scratch via self-play on multi-TPU pods, and shipped a full-stack web app so you can play against it, from just 2 prompts. . Here’s what else makes 3.5 Flash special 🧵
Gemini 3.5 Flash is rolling out globally for consumers in the Gemini app and Search AI Mode, for developers via the Gemini API, Google Antigravity, and Google AI Studio, and for businesses on the Gemini Enterprise Agent Platform. Read more about the Gemini 3.5 era: https://blog.google/innovation-and-ai/models-and-research/gemini-models/gemini-3-5/
Gemini 3.5 Flash delivers sustained frontier-level performance at lightning-quick speeds: Beats 3.1 Pro on coding & agentic benchmarks Terminal-Bench 2.1 (76.2%), GDPval-AA (1656 Elo) and MCP Atlas (83.6%) 4x faster than other frontier models (12x in Antigravity!) SOTA on multimodality with 83.6% on MMMU-Pro
Kind of a big deal
Meet Gemini 3.5 Flash — our strongest agentic and coding model yet. It delivers frontier-level performance at 4x the speed of comparable frontier models — often at less than half the cost. Generally available, starting today. 🧵 #GoogleIO
Holy shit man
Gemini 3.5 Flash is built to help you execute complex, agentic workflows. 3.5 Flash rivals flagship models to deliver frontier performance for agents and coding, at the lightning speeds you expect from the Flash series.
@scaling01 I hope so.
Gemini 3.5 Pro is the Gemini Ultra we always wanted
Looking forward to gemini-cli becoming usable
Gemini 3.5 Flash is built to help you execute complex, agentic workflows. 3.5 Flash rivals flagship models to deliver frontier performance for agents and coding, at the lightning speeds you expect from the Flash series.
Today, we introduced Gemini 3.5 Flash ⚡ Our most capable coding and agentic model — where "fast" and "best" aren't a tradeoff. Try it now across Antigravity, AI Studio, Gemini App, and AI Mode.
Welcome to Gemini 3.5 Flash, our most powerful model to date. It pushes the frontier of intelligence, speed, and cost putting 3.5 Flash in a class of its own. We spent the last 6 months making sure Flash is great for real world use cases. It's available everywhere now!
I’m extremely proud of the team and this has been one of most intense and most rewarding launches we have done! And we are not done yet and are busy cooking 3.5 Pro.
Today, we introduced Gemini 3.5 Flash ⚡ Our most capable coding and agentic model — where "fast" and "best" aren't a tradeoff. Try it now across Antigravity, AI Studio, Gemini App, and AI Mode.
Here is a fun demo showcasing the model’s Web Development capabilities.
I’m extremely proud of the team and this has been one of most intense and most rewarding launches we have done! And we are not done yet and are busy cooking 3.5 Pro.
Gemini 3.5 Flash is now GA. Our most capable Flash model, built for agentic execution, coding, and long-horizon tasks.
- Outperforms Gemini 3.1 Pro on coding and agentic tasks - 1M token context window with 65k max output tokens - 4x faster output tokens/sec - 4 thinking levels: minimal, low, medium (new default), high - Thought preservation across multi-turn conversations automatically
Available today in @GoogleAIStudio, @Android Studio, @antigravity, Gemini Enterprise, the @GeminiApp, and AI Mode in Search.

Developer Guide: https://ai.google.dev/gemini-api/docs/interactions/whats-new-gemini-3.5 Introducing Gemini 3.5: https://blog.google/innovation-and-ai/models-and-research/gemini-models/gemini-3-5/ AI Studio: https://aistudio.google.com/prompts/new_chat?model=gemini-3.5-flash
Gemini 3.5 Flash is now GA. Our most capable Flash model, built for agentic execution, coding, and long-horizon tasks. - Outperforms Gemini 3.1 Pro on coding and agentic tasks - 1M token context window with 65k max output tokens - 4x faster output tokens/sec - 4 thinking levels: minimal, low, medium (new default), high - Thought preservation across multi-turn conversations automatically Available today in @GoogleAIStudio, @Android Studio, @antigravity, Gemini Enterprise, the @GeminiApp, and AI Mode in Search.
@OfficialLoganK @GoogleDeepMind Good model
Welcome to Gemini 3.5 Flash, our most powerful model to date. It pushes the frontier of intelligence, speed, and cost putting 3.5 Flash in a class of its own. We spent the last 6 months making sure Flash is great for real world use cases. It's available everywhere now!
We wrote a new Developer Guide for Gemini 3.5 Models. Easiest way to migrate is install the Interactions API Skill. Then run
``` /gemini-interactions-api migrate my app to Gemini 3.5 Flash ```
Skill available at: `npx skills add google-gemini/gemini-skills --skill gemini-interactions-api`
it now seems very likely to me that the new Gemini 3.5 Pro model will be a 10T+ model like Mythos
it's Gemini 3.5 Flash day but pricing is $1.5 / $9 per mtoks 💀
pricing of 3.5 Pro should be $6 / $27 if they keep the 4x scaling and the trend of undercutting OpenAI
it now seems very likely to me that the new Gemini 3.5 Pro model will be a 10T+ model like Mythos
Gemini 3.5 Pro is the Gemini Ultra we always wanted
it now seems very likely to me that the new Gemini 3.5 Pro model will be a 10T+ model like Mythos
Google introduces Gemini 3.5 Flash

intelligence too cheap to meter

Google built an entire operating system with Gemini 3.5 Flash in 12 hours for less than $1000

Strong performance across the board for Gemini, but holy crap GPT-5.5 is goated at long context WTF
Gemini 3.5 Flash is built to help you execute complex, agentic workflows. 3.5 Flash rivals flagship models to deliver frontier performance for agents and coding, at the lightning speeds you expect from the Flash series.
Narrator: they already fucked up
→ Gemini 3.5 Flash not available on API.
→ Fast mode locked to Antigravity only.
I don't understand why companies keep doing this.
They invent a portal gun, only to lock it behind a taxi subscription, because they completely fail to realize their very product deprecates that other thing they think will make them money?
Cursor is a great example of a company that (sadly) is very likely fail because of that mindset. Composer is actually surprisingly good model. They should put all efforts in serving it. Yet, they keep locking under a old school product that nobody wants to use.
It is 2026. NOBODY should be using IDEs anymore. Get over it. Let it GO. I'm certainly not launching a VSCode fork to use a model, no matter how great it is.
Your model is the product.
You do NOT need an IDE to make money.
You keep chasing old business models.
Completely out of touch.
Meanwhile Anthropic is all charging at full speed to sooner or later surpass Google by just serving great models under an API!!!
It is not hard. WHY it has to be so hard
. . .
The new Gemini 3.5 Flash solved the HVM3's wnf bug in 1/3 attempts. This is my main test to take a model seriously. So far only the big models like GPT 5.5 solved it. And seems like it is 20x faster than Opus 4.6 ! Promising but Google will still find a way to fuck up
Narrator: they already fucked up
→ Gemini 3.5 Flash not available on API.
→ Fast mode locked to Antigravity only.
I don't understand why companies keep doing this.
They invent a portal gun, only to lock it behind a taxi subscription, because they completely fail to realize their very product deprecates that other thing they think will make them money?
Cursor is a great example of a company that (sadly) is very likely fail because of that mindset. Composer is actually surprisingly good model. They should put all efforts in serving it. Yet, they keep locking it under an old school product that nobody wants to use.
It is 2026. NOBODY should be using IDEs anymore. Get over it. Let it GO. I'm certainly not launching a VSCode fork to use a model, no matter how great it is.
And even these who DO use IDEs probably won't necessarily pick YOUR IDE. And they shouldn't. You do NOT need them to, to make money.
Your model is the product.
You keep chasing old business models.
Completely out of touch.
Meanwhile Anthropic is all charging at full speed to sooner or later surpass Google by just serving great models under an API!!!
It is not hard. WHY it has to be so hard
. . .
The new Gemini 3.5 Flash solved the HVM3's wnf bug in 1/3 attempts. This is my main test to take a model seriously. So far only the big models like GPT 5.5 solved it. And seems like it is 20x faster than Opus 4.6 ! Promising but Google will still find a way to fuck up
@melvinjohnsonp ⚡ woohoo! incredible work! congratulations! ⚡
Today, we introduced Gemini 3.5 Flash ⚡ Our most capable coding and agentic model — where "fast" and "best" aren't a tradeoff. Try it now across Antigravity, AI Studio, Gemini App, and AI Mode.
4/ Gemini Omni Flash is the other monster announcement.
Google’s framing: “create anything from any input — starting with video.”
Text, images, video, audio as inputs → high-quality generated/edited video grounded in Gemini’s world knowledge.
3/ The positioning is clear: Flash is no longer just “cheap fast model.” Google wants Gemini 3.5 Flash to be the default engine for long-horizon agents: plan, build, iterate, use tools, execute code, complete real work. Gemini 3.5 Pro comes next month. Can't wait to try Flash
Look at them digits
Welcome to Gemini 3.5 Flash, our most powerful model to date. It pushes the frontier of intelligence, speed, and cost putting 3.5 Flash in a class of its own. We spent the last 6 months making sure Flash is great for real world use cases. It's available everywhere now!
Insane evals for a Flash model! Gemini 3.5 Flash is really good for its size!

Gemini 3.5 Flash official! Insanely fast an capable model
Gemini 3.5 Flash official! Insanely fast an capable model
„Progress towards AGI“: Gemini Omni - world models -Gemini Omni official!! It can create anything from any input!!!
Gemini 3.5 pro next month!!!

Gemini 3.5 Flash official! Insanely fast an capable model
OH WOW! GOOGLE FINALLY BECOMES A LEGIT AI COMPANY Gemini 3.5 Flash is Generally Available!!
No more preview launches with rate limits!!!
This is earth shattering....😲😲
OH WOW! GOOGLE FINALLY BECOMES A LEGIT AI COMPANY Gemini 3.5 Flash is Generally Available!!
No more preview launches with rate limits!!!
This is earth shattering.... It's like they really have an engineering team 😲😲
Gemini 3.5 Flash is here!!! 🚀🚀
Priced at 3x it's predecessor but still WAY CHEAPER than GPT 5.5 or Opus 4.7
We are evaluating the model against a bunch of real-world quality evals. Results coming later today
coding with flash is a different experience, it's absurdly fast, it sometimes feels instant. for hard debugging tasks it can explore large areas of the problem space in minutes. it can outperform bigger models on hard tasks by crunching more tokens in less clock time
Welcome to Gemini 3.5 Flash, our most powerful model to date. It pushes the frontier of intelligence, speed, and cost putting 3.5 Flash in a class of its own. We spent the last 6 months making sure Flash is great for real world use cases. It's available everywhere now!
@yacineMTB The main thing I was looking forward to
Kind of a big deal





