23h ago

OpenClaw outperforms Hermes Agent in Qwen 35B local model benchmark

0

Atomicbot.ai shared benchmark results comparing OpenClaw and Hermes Agent running the Qwen 35B local model on a MacBook Pro M5 Max 64GB. On May 15 2026 the agents were tasked with scraping GitHub star history data, identifying growth spike causes and creating a live browser dashboard. OpenClaw completed the work using 203k tokens in 12 minutes 1 second via a bash script whereas Hermes used 257k tokens.

Original post
Reposted by

I’m sorry that you’re this desperate that you will take such an unscientific benchmark instead of any established one.

Also qwen local is one of the most random length models there is with all its looping. And we smoke you all on quality benchmarks on every open model. Here’s wildclawbench by internlm, same speed on open models, much better results.

Peter Steinberger 🦞Peter Steinberger 🦞@steipete

Looks like our focus on performance paid off.

8:47 AM · May 16, 2026 · 279.5K Views
5:15 PM · May 16, 2026 · 31.2K Views

Also there’s this wolfbench too by weights and biases. But do any of these benchmarks matter? I’d argue generally no! It’s the user experience and the community experience that matters, and you have failed. Thats part why your token throughput has done nothing but collapse, starting the day Hermes was released, and in just 3 days since surpassing you, we’ve nearly 2.5x’ed your token volume. Real users using it. They chose.

5:26 PM · May 16, 2026 · 16.6K Views

While i have the lucky few of you who read Peters tweets here, heres a free 20$ sub to Nous Portal for Hermes Agent, where you can access 300+ models at discounted rates and all the core hermes tools, and even free models. First 100 new users only. Time to drop the lobster

BAZOZFN1

100 uses. Sign up at https://portal.nousresearch.com/manage-subscription

Teknium 🪽Teknium 🪽@Teknium

Also there’s this wolfbench too by weights and biases. But do any of these benchmarks matter? I’d argue generally no! It’s the user experience and the community experience that matters, and you have failed. Thats part why your token throughput has done nothing but collapse, starting the day Hermes was released, and in just 3 days since surpassing you, we’ve nearly 2.5x’ed your token volume. Real users using it. They chose.

5:26 PM · May 16, 2026 · 16.6K Views
5:34 PM · May 16, 2026 · 5.6K Views
OpenClaw outperforms Hermes Agent in Qwen 35B local model benchmark · Digg