This is exactly right.
People are starting to look for cheaper model alternatives and realizing two things at once: open-source models are already very good, and the ability to train and serve them efficiently at scale can change the economics pretty meaningfully.
Tokens are still being subsidized, demand is ramping quickly, and the compute crunch is likely to persist. That will push companies toward using the right model for each task instead of defaulting to the most expensive one.
We’re still early, but I expect open-weight adoption to accelerate much faster than most people think.















