GLM-5.2 Max reasoning claims the top spot on PostTrainBench, beating Opus 4.8 Max with 100% execution reliability · Digg