Harnesses matter way more than people think.
Cline ran a couple of experiments on a set of coding tasks using GLM 5.2:
• 57.3% using their harness with reasoning turned off. • 68.5% with their harness with reasoning turned up.
That's a difference of 11.2 percentage points! Same model, same set of problems. The difference stemmed from how the model was driven by the harness.
Current open-weight models are way more capable than we think. They aren't the bottleneck anymore.
We need better harnesses.
We’ve been impressed with GLM-5.2 and so are introducing a $9.99/month subscription to give you 2-5x discounted access to it and other open weight models like DeepSeek, Kimi, MiniMax, Mimo, Qwen.
Use it on Cline CLI & IDE with $1.99 special promo if sign up via: npm i -g cline
















