Maybe we are moving to the "optimization" phase of the AI game. Lots of things change in this phase. In the dot-com boom, all website were built on Oracle+Sun. Five years later all MySQL + Linux. Eventually the margins matter.
In addition to enterprises pushing to better understand token costs, I think the consumer models (I use many) are "trying less hard" recently. Meaning, trying to do answer without web search. Angling to give up sooner. I have to believe this is a result of cost optimization.













