/Tech5h ago

Open-weights model GLM-5.2 surpasses proprietary Claude Opus 4.7 and GPT 5.4 on the Runescape benchmark

The surpassed proprietary models led rankings two months ago

627361913472.2K

#756

Original post

mattparlmer 🪐 🌷@mattparlmer

We need an American lab that can make open weights models that are this good

max@maxbittker

GLM-5.2 just scored better than Opus 4.7 and GPT 5.4 on Runescape bench.

These models were best in class only 2-3 months ago.

Open source frontier is catching up!?

5:49 PM · Jun 16, 2026 · 11.3K Views

Sentiment

Many users congratulated the open-source GLM-5.2 release for topping Opus and GPT on the Runescape benchmark while others dismissed that benchmark as irrelevant to real-world use.

Pos

42.9%

Neg

57.1%

8 comments with sentiment.

Cluster Engagement

Digg Deeper

No Digg Deeper questions have been answered for this story yet.

Posts from X

Most Activity

VIEWS2.5KLIKES15

max@maxbittker

congrats @Zai_org on the release!

9h2.5K15

BOOKMARKS1

max@maxbittker

@thaonlyjonathan $150 at api price- Fable was cheaper than GPT-5.5 xhigh.

And for reference, GLM-5.2 was $32

3h22211

RETWEETS16

max@maxbittker

GLM-5.2 just scored better than Opus 4.7 and GPT 5.4 on Runescape bench.

These models were best in class only 2-3 months ago.

Open source frontier is catching up!?

9h61.7K539122

REPLIES1

Jia Ming (إحسان)@__jiaming__

@ChickenSamosaa @mattparlmer the reason they do it is simply cuz they're not the best. and it helps gain adoption without compromising their standing

3h15

henrique@otaldohenrikkk

@maxbittker The rule is clear: if Gemini appears on a benchmark with a high score, the benchmark is useless.

7h4488

Chris Laupama@chrislaupama

@maxbittker ah runescape, the only benchmark that matters.

6h6548

0xSammy@0xSammy

@maxbittker The open source renaissance is beginning

At a fraction of the cost

Love this benchmark

8h1.3K6

JMB 🧙‍♂️@jmbollenbacher

@mattparlmer Impossible I'm afraid. American tech culture forbids it.

Americans have the mentality that they should be billionaires if they can make something good, and so they'll never make good models free open weights.

The US will never have an open weights ecosystem the way China does.

3h574

guille@angeris

@mattparlmer 5.2 is surprisingly good at reverse engineering btw

3h1593

aech@smplrandom

@mattparlmer @DanielleFong Arcee?

5h1712

mattparlmer 🪐 🌷@mattparlmer

@angeris Reverse engineering what?

3h1022

max@maxbittker

@theAlexQuach Natural log of peak xp per minute

3h2091

mattparlmer 🪐 🌷@mattparlmer

@smplrandom @DanielleFong They have yet to drop a model anywhere near this high scoring on benchmarks

5h1751

Alex Quach@theAlexQuach

@maxbittker scores should be out of 120 or 200M imo

3h252

Dylan Fiori@dtfiori

@maxbittker RuneScape bench?!?! Why is this the first I’m hearing of this amazing bench lol

6h6233

Cullen@cullend

@mattparlmer Apple should do it/ fund it (bc brand safety) They pay Google $1 billion a year, a pittance to running a real AI lab and a bet LLMs get commodified. Timing would be right, after the lab IPOs, if a dozen or so really good engineers decide they have enough money/ take a low salary

5h158

guille@angeris

@SCH_Clay @mattparlmer software (for now!)

i don’t like stuff connecting to the internet at all times

3h331