/Tech6h ago

GLM-5.1 Trails Leading Models on Private WeirdML Benchmark

213021.9K

Original post

Teortaxes▶️ (DeepSeek 推特🐋铁粉 2023 – ∞)@teortaxesTex#501inTech

I am waiting for @htihle's WeirdML results one of the few truly hard, fully private benchmarks that really humble Chinese models so far. Glm-5.1 is the best they could do so far. 5.1 was, I think, a lesser step-up vs 5.0 than 5.2 is to 5.1. I brazenly predict 0.725.

Teortaxes▶️ (DeepSeek 推特🐋铁粉 2023 – ∞)@teortaxesTex

When Chinese bros have more compute, starting in H2 2026… they won't race ahead, because American bros (at least smart ones) have been investing lots of compute into experiments, so they'll have an easier time with large-scale training. This remains a close competition.

10:46 AM · Jun 17, 2026 · 1.1K Views

Sentiment

Sentiment building, check back later.

Cluster Engagement

Digg Deeper

No Digg Deeper questions have been answered for this story yet.

Posts from X

Most Activity

VIEWS756BOOKMARKS2LIKES5REPLIES1

Teortaxes▶️ (DeepSeek 推特🐋铁粉 2023 – ∞)@teortaxesTex

@htihle Lower bound, I'd say, is 0.680 Upper bound is 0.785

Teortaxes▶️ (DeepSeek 推特🐋铁粉 2023 – ∞)@teortaxesTex

6h75652