Google is held to an unreasonably high standard. Flash would ordinarily be 10% of the cost of GPT-5.5. We're not in the age where Only Google hill-climbs hard math. Even fucking Anthropic ships models that beat… DeepSeek. Everyone is serious now.
If this is Gemini 3.2 flash, it's strictly worse than gpt 5.5 at math. Fails MathArena Apex #11 and #12/ IMO p6

