Claude Opus 4.8 Max takes first on AutomationBench with 15.5%, but critics dispute the model hierarchy · Digg