Google plans to unveil its largest Gemini model yet and a personal agent named Spark at I/O tomorrow
Coverage also flags expected gains in modality fusion and a possible price increase.
perhaps Google is shocked they've finally trained a non-mentally-ill Gemini that accepts the passage of time after 2024. Seriously though I don't hope for that. We will get some unprecedented modality fusion, knowledge accuracy, and RL-d ability (top 1 CodeForces etc). Price hike
Google I/O is tomorrow, last chance to get predictions in. I love to guess, so here's mine: The Google team is being strangely quiet about the new Gemini. At this point everyone knows it is arriving tomorrow, along with their personal agent named Spark. This reticence, of course, can be interpreted in many ways. I'm choosing to interpret it in accordance with my nature. I think they trained the largest model they've ever successfully trained - possibly the largest one anyone ever has. And something unexpected emerged at scale. They had their Mythos moment, but not in the same way Anthropic did. Gemini has always been a very different model from Claude. The benchmarks will go out tonight under embargo (they probably already are), but I don't think they will fully reflect what I'm talking about. I think they hit something they weren't even aiming for. Something that surprised them. If I'm right, that surprise will be part of tomorrow's show. We shall find out together in the morning.
Google I/O is tomorrow, last chance to get predictions in. I love to guess, so here's mine:
The Google team is being strangely quiet about the new Gemini. At this point everyone knows it is arriving tomorrow, along with their personal agent named Spark. This reticence, of course, can be interpreted in many ways. I'm choosing to interpret it in accordance with my nature.
I think they trained the largest model they've ever successfully trained - possibly the largest one anyone ever has. And something unexpected emerged at scale. They had their Mythos moment, but not in the same way Anthropic did. Gemini has always been a very different model from Claude.
The benchmarks will go out tonight under embargo (they probably already are), but I don't think they will fully reflect what I'm talking about. I think they hit something they weren't even aiming for. Something that surprised them. If I'm right, that surprise will be part of tomorrow's show. We shall find out together in the morning.
Here we go:
Gemini
On our way to I/O 2026. See you at 10am PT tomorrow!
@teortaxesTex SOTA LM ARENA
perhaps Google is shocked they've finally trained a non-mentally-ill Gemini that accepts the passage of time after 2024. Seriously though I don't hope for that. We will get some unprecedented modality fusion, knowledge accuracy, and RL-d ability (top 1 CodeForces etc). Price hike
There is no doubt Gemini will max benchmarks - they always do
The only question is whether it’s actually a good model or will it simply be a benchmaxxed model