GLM results are insanely high If this is "distillation", it's doing better than any previous attempt Eg ProofBench should be amenable both to distillation and RLVR, yet V4, Kimi, Mimo, Grok lol are all hopeless. Zhipu will obviously aim to fully match/exceed 5.5&4.8 with 5.3.
Full results for GLM 5.2 are here!
This open-weight model ranks #1 on the Vals Index, Harvey’s Legal Agent Benchmark, Finance Agent v2, ProofBench, and Vibe Code Bench, and places in the top four open-weight models across all of our in-house benchmarks.

