Z.ai's GLM-5.2 scores 22.8% on ARC-AGI-2, leading Chinese models but prompting debate over Western benchmark hill-climbing · Digg