Open weights models, via GLM 5.2, had their "very practically useful" in coding harness moment before Gemini. ~200 days since the release of Opus 4.5.
AI2 post-training lead Nathan Lambert says open-weights models achieved practical coding utility ahead of Google's Gemini
Story Overview
Nathan Lambert, AI2's post-training lead, highlighted that open-weights models like GLM 5.2 have reached a stage of real coding and agentic usefulness ahead of Google's Gemini, with the milestone landing roughly 200 days after Anthropic's Claude Opus 4.5 launch.
Benchmarks show open models pulling ahead on specific tasks
GLM 5.2, a 744B-parameter MoE model from Z.ai with a 1M-token context window, posted leading open-weights scores on coding arenas and agentic benchmarks, though direct head-to-head data against Gemini remains limited to those public leaderboards.
Questions linger on capability versus ecosystem reach
Observers note it is unclear whether Gemini trails in raw ability or simply faces narrower adoption for coding workflows, with no further details available on timelines or broader performance gaps.
Users are reacting to open weights models reaching practical coding usefulness before Gemini, with some praising their rapid progress while others call Gemini mediocre and express sadness over Google's lagging position.
No Digg Deeper questions have been answered for this story yet.
Most Activity
@natolambert damn, that's a good point.
though I don't know if gemini really isn't there or if they have just managed to gather zero interest in adoption of their ecosystem.
Open weights models, via GLM 5.2, had their "very practically useful" in coding harness moment before Gemini. ~200 days since the release of Opus 4.5.

@liyucheng_2 sir be realistic, its a community consensus

@natolambert wrong, that's glm5.1, 5.2 is another leap

@natolambert I am a Google shareholder. This makes me very sad. Can Google recover, or is it too late already?

@natolambert What's the least spec of GPU I need to run GLM 5.2 locally

@Onwuta_Kelvin @natolambert 4x 128GB Nvidia Spark for not usable token per second. ~$12k

@natolambert What is your own experience with GLM 5.2 in a coding harness?

@natolambert

@natolambert kimi 2.5 was a moment as well, multi modal and fast.

@natolambert Gemini doesn't seem like a serious model at all compared to GPT-5.5. Feels like some mediocre open model to be frank.

@natolambert 开源模型这波追得真快,编码场景最明显

@goldstein_aa @natolambert I’m with you. Feel your pain. But Google has deep pockets and a lot of talent.
Let’s see how the next iteration of Gemini performs. My investments also hope it can get back onto the frontier.

@natolambert @liyucheng_2 4.5已经很好了,考虑到glm5.2大小只有753B,我相信中国很快就可以做出fable级别的,阿迪王那个傻逼用fable恐吓全世界简直太反人类了

@natolambert its been ~200 days of everyone pretending open weights was supposed to keep pace
the utility gap is just longer than people wanna admit

@natolambert Insane

@liyucheng_2 @natolambert The kabobs don’t lie.

@QuentinCompsci @natolambert Bruh!!