/Tech2h ago

GLM 5.2 Tops Vals AI Leaderboard and Multiple Agent Benchmarks

792477.3K

Original post

Teortaxes▶️ (DeepSeek 推特🐋铁粉 2023 – ∞)@teortaxesTex#501inTech

GLM results are insanely high If this is "distillation", it's doing better than any previous attempt Eg ProofBench should be amenable both to distillation and RLVR, yet V4, Kimi, Mimo, Grok lol are all hopeless. Zhipu will obviously aim to fully match/exceed 5.5&4.8 with 5.3.

Vals AI@ValsAI

Full results for GLM 5.2 are here!

This open-weight model ranks #1 on the Vals Index, Harvey’s Legal Agent Benchmark, Finance Agent v2, ProofBench, and Vibe Code Bench, and places in the top four open-weight models across all of our in-house benchmarks.

8:55 PM · Jun 18, 2026 · 6.1K Views

Sentiment

Users in the replies defend GLM 5.2's benchmark leadership as genuine hard work rather than distillation, while others sarcastically dismiss the results as mere version-number parity with OpenAI.

Pos

50.0%

Neg

50.0%

2 comments with sentiment.

Cluster Engagement