The frontier orchestrator gap is real and the observation is correct. But domain-specific performance is a different story. Just published research today with a legal expert who co-wrote Slovenian legislation. We specifically tested Gemini 3.1 Pro on complex legal reasoning tasks. The results were striking enough that we called legal services the second biggest AI disruption currently underway, right behind coding. The "law is code" insight is literal, not a metaphor. Legal reasoning is variables, conditionals, dependencies, edge cases. Exactly what these models are optimized for. Flash for speed, Pro for reasoning depth on a specific domain. The absence of a general frontier model doesn't mean the model can't do frontier work in the right context. The question is whether Google understands how to position it or whether they're still chasing the general benchmark.