/Tech4h ago

Developer @burkov says open-source GLM 5.2 matched OpenAI Codex performance during a three-day coding workflow test

Story Overview

A developer reports that the open-source GLM 5.2 held its own against OpenAI Codex across a full three-day coding workflow, delivering equivalent results on bug fixes and feature additions despite being freely available for local use.

1431.7K83393121.9K

#682

Original post

Andrew Curran@AndrewCurran_#682inTech

BURKOV@burkov

For the last three days, I've been using GLM 5.2 with OpenCode instead of Codex and I don't see any difference.

There wasn't any bug that GLM would fail to fix or a feature it would fail to add as requested.

The only downside is that this model cannot see, so if it's simpler to explain an issue by pasting a screenshot, I would still use Codex. Otherwise, GLM would be my choice.

Will continue to use it for two more weeks and, if it keeps just working, I will cancel my $100/month subscription with OpenAI. I already cancelled my Anthropic subscription and have no regrets.

No moat isn't hypothetical anymore.

8:01 AM · Jun 20, 2026 · 4.7K Views

Developer Impact

Where vision still forces a hybrid setup

The model handles text-only coding tasks well yet lacks any image input, so screenshot-based debugging continues to require Codex or a separate vision tool with no indication yet of when that gap might close.

Cost Pressure

Access and pricing leave room for wider testing

Open weights dropped quickly after an initial subscriber window, with usage costs reported far below closed frontier models, though the scale of any subscription cancellations remains unquantified beyond individual anecdotes.

Sentiment

Many users praise GLM 5.2 for matching Codex-level coding performance and replacing expensive subscriptions, while some note drawbacks like missing multimodality.

Pos

64.7%

Neg

35.3%

17 comments with sentiment.

Cluster Engagement

Digg Deeper

No Digg Deeper questions have been answered for this story yet.

Posts from X

Most Activity

VIEWS5.1K

David Hapunkt@nichtsalsdavid

@burkov How much more GLM-5.2 do i get out of a Ollama Cloud subscription compare to OpenAI Codex?

8h5.1K101

BOOKMARKS4LIKES20

Andrew Curran@AndrewCurran_

elvis@omarsar0

GLM-5.2 is great at design (Opus level IMO).

I am also starting to see great results with long-running tasks, too.

How is this possible?

I think there are a few clever hacks. But I just came across this from the official blog, and they actually trained this model with an anti-hacking module.

RL, as many know, comes with this issue of reward hacking that often enables the model to take weird and suboptimal shortcuts. Not only that, but it makes the models sometimes feel like it's sometimes "lazy" or just plain "dumb" at times, including other issues like intent misalignment, verbosity, sycophancy, deception, etc. And you really don't want that for long-running tasks operated by coding agents.

This is a great insight. If you use the standard /goal (in 5.5 or 4.8), you notice the models often take shortcuts that lead to long-running tasks (wasting tokens along the way) but with poor results. This is why I advocate for a focus on better verifiers.

So this anti-hacking idea is a model capability that should, in theory, lead to better results on long-horizon tasks.

I've seen efforts here and there in a few research papers, but haven't seen it translated to much, much less in a frontier, open-weight model.

This might be contributing to some of the great results we are seeing with GLM-5.2, but I suspect there is more, of course, like better verification capabilities. It's not clear how all of these training signals lead to downstream capabilities, but this is something to look at closely with newer models.

1h3.4K204

RETWEETS82

BURKOV@burkov

For the last three days, I've been using GLM 5.2 with OpenCode instead of Codex and I don't see any difference.

There wasn't any bug that GLM would fail to fix or a feature it would fail to add as requested.

The only downside is that this model cannot see, so if it's simpler to explain an issue by pasting a screenshot, I would still use Codex. Otherwise, GLM would be my choice.

Will continue to use it for two more weeks and, if it keeps just working, I will cancel my $100/month subscription with OpenAI. I already cancelled my Anthropic subscription and have no regrets.

No moat isn't hypothetical anymore.

10h117.4K1.7K398

REPLIES5

Christopher Cook@webprofusion

@burkov Yeah glm5.2 works well. I'm using it via openrouter, token cost for me is still up to $50 a day though, depending on the job.

9h3.4K191

John Ennis@johnennis

@burkov Why not just set up a little Gemini flash lite service for vision?

8h2.1K173

Slopware Engineer@aienginerd

just curious why you would prefer (without a doubt) a weaker model and on top of that, paying for tokens that aren't subsidized and even at gym 5.2 pricing you'll blow past that $100 easily if you're doing real and daily work with it. I want open source to win too, but this doesn't seem like the moment, yet to be doing all that.

6h2.8K17

TolkienWindow@AbionMorse

@webprofusion @burkov this might be worth checking out too https://ollama.com/pricing

8h29623

IGOOR.ORG@igoor_org

@burkov Inside Claude Code you can natively paste screenshots (I think it's the only one natively supporting it).

8h2.4K21

Prathmesh Pandey@file_mutex

@burkov glm5.2 is much less vfm than a $100 codex plan though.

the zai's coding subscription plan would have been a better deal but then you end up sending data to china.

7h2.4K5

jorge guerrero@jorgiting

@file_mutex @burkov as opposed to sending all your data to anthropic(the good guys)

6h29813

Mohamed yousof@mohamedyousof

@burkov Use official vision MCP https://docs.z.ai/devpack/mcp/vision-mcp-server

6h7612

Alexandre Pires@alexclpires

@nichtsalsdavid @burkov You get a lot more, but the main issue with ollama it's just the latency of it sometimes that makes it so unusable... if it was working even 95% of the time, it would be one of the best bang for your buck kinda deal.

5h28011

Nexzul@NexzulX

@burkov Is GLM really as good? I figured it would be benchmaxxed.

3h1.2K3

BowTied Biohacker@BowTiedUM

@burkov @q_yeon_gyu_kim we got another one unaware of the multimodal looker

2h3.8K11

Hassan@hassandev22

@file_mutex @burkov What will they do with your data that US can't

7h28811

Andrew@ai_ops_lead

@burkov Wonder if you could give it a tool that uses an OAI model to answer questions about images. Because at least for frontend work it's essential that the model can take screenshots as it works. That's the only way to implement UI features autonomously, IMO.

6h1.7K1

Woji Piskorz@WojciechPiskorz

@burkov they solved it with vision MCP (installed by default with zcode or with MCPs in opencode https://docs.z.ai/devpack/mcp/vision-mcp-server)

5h30021

David Peterson@MathmoKiwi

@nichtsalsdavid @burkov I wonder also how the US$20/month Ollama Cloud subscription compared against the @OpenCode Go subscription for $10/month?

8h6195

Eeshan@notesundrground

@burkov I had already switched over to GLM 5.1 for the past couple of months, and GLM 5.2 is even better. It’s liberating to not be part of the Claude / Codex dramas. I’ll be back to them once they turn back into stable model providers.

4h1.2K8

象道@jiadehui

@MoonlitMaven @burkov You may have a look at API service on http://aimoway.com Forget those subscription plans.

3h10121