/Tech11h ago

Nous Research co-founder Teknium launches Mixture of Agents 2.0, enabling Hermes Agent to combine multiple LLMs into virtual models

The mixture beat GPT 5.5 by 11% on HermesBench.

5346.2K5172.7K1.1M

#331

Original post

Teknium 🪽@Teknium#331inTech

Introducing Mixture of Agents 2.0 in Hermes Agent.

Combine any provider's models into a mixture of your own. Access your presets as if it were a normal model in Hermes.

Big improvement in our soon-to-release HermesBench against opus and gpt-5.5 with MoA using Opus & GPT together.

Nous Research@NousResearch

The strongest models are gated and access is granted only to a select few.

Hermes Agent now exposes MoA presets as virtual models, giving you capabilities beyond the publicly available frontier: 8% higher than Opus 4.8 and 11% higher than GPT 5.5 on our upcoming benchmark.

2:07 PM · Jun 26, 2026 · 303.8K Views

Sentiment

Many users are excited about Hermes Agent's Mixture of Agents 2.0 because it combines multiple models via custom presets to outperform benchmarks like Opus and GPT.

Pos

96.1%

Neg

3.9%

135 comments with sentiment.

Cluster Engagement

Digg Deeper

No Digg Deeper questions have been answered for this story yet.

Posts from X

Most Activity

VIEWS40.2KLIKES641RETWEETS20REPLIES61

Teknium 🪽@Teknium

@NousResearch We are working on benching various combos of open source models to see if we can get Opus levels with much cheaper models as well, stay tuned!

11h40.2K64126

BOOKMARKS115

Teknium 🪽@Teknium

@NousResearch Docs on how to setup your own custom mixture of models here:

https://hermes-agent.nousresearch.com/docs/user-guide/features/mixture-of-agents

11h5.5K129115

Nous Research@NousResearch

HermesBench full leaderboard coming soon. Stay tuned!

11h13.2K18017

Teknium 🪽@Teknium

Give mixture of agents a try today!

YanXbt@IBuzovskyi

HERMES AGENT MIXTURE OF AGENTS COMBINES MULTIPLE MODELS INTO ONE ANSWER. 8% HIGHER THAN OPUS 4.8. 11% HIGHER THAN GPT-5.5. NO GATED ACCESS REQUIRED.

Mixture of Agents (MoA) runs multiple models on the same query in parallel. an aggregator model synthesizes all responses into one answer that outperforms any single model alone.

Nous Research benchmarks: 8% higher than Opus 4.8. 11% higher than GPT-5.5. (upcoming benchmark, numbers from official announcement.)

HOW IT WORKS:

you select a MoA preset as your model. Hermes fans out your query to 2-3 reference models. each responds independently. the aggregator reads all responses and synthesizes one final answer.

you see one response. behind it: multiple perspectives.

DEFAULT PRESET:

moa: default_preset: default presets: default: reference_models: - provider: openai-codex model: gpt-5.5 - provider: openrouter model: deepseek/deepseek-v4-pro aggregator: provider: openrouter model: anthropic/claude-opus-4.8 reference_temperature: 0.6 aggregator_temperature: 0.4 max_tokens: 4096 enabled: true

two reference models generate diverse responses. Opus aggregates them into one answer. the output is better than any of the three alone.

SETUP:

Desktop app / Dashboard → Models → MoA presets CLI: hermes moa configure

create named presets for different tasks:

hermes moa configure # default preset hermes moa configure review # create "review" preset hermes moa configure research # create "research" preset hermes moa list # see all presets hermes moa delete review # remove a preset

BUILD YOUR OWN PRESETS:

CHEAP RESEARCH (2 models, budget):

presets: research_lite: reference_models: - provider: openrouter model: deepseek/deepseek-v4 - provider: openrouter model: google/gemini-2.5-flash aggregator: provider: openrouter model: anthropic/claude-sonnet-4.6

diverse perspectives at budget prices. Sonnet aggregates. good enough for daily research.

MAXIMUM QUALITY (3 models, premium):

presets: full_power: reference_models: - provider: openai-codex model: gpt-5.5 - provider: openrouter model: deepseek/deepseek-v4-pro - provider: openrouter model: google/gemini-2.5-pro aggregator: provider: openrouter model: anthropic/claude-opus-4.8

three frontier models + Opus aggregation. this is the preset that beats benchmarks.

MID-SESSION TOGGLE:

/moa # toggle MoA on/off /moa research # switch to named preset /moa off # disable, use plain model

working on routine code? MoA off. hit a hard architectural problem? /moa on. one command. no config edit. no restart.

SAFETY RAILS:

→ aggregator cannot be another MoA preset (recursive MoA trees blocked) → enabled: false disables fan-out (aggregator acts as a plain model) → each reference model runs in parallel (wall clock ≈ slowest model, not sum)

WHERE MoA MAKES SENSE:

→ complex architecture decisions → research synthesis across diverse sources → code review where one model misses edge cases → critical content that needs multi-perspective verification → any task where "second opinion" matters

WHERE MoA IS OVERKILL:

→ simple file edits → routine web searches → cron jobs and monitoring → tasks where speed matters more than depth

MoA multiplies your token cost by the number of reference models. use it for the 10% of tasks where quality matters most.

full Hermes architecture deep-dive in the article 👇

2h7.4K11923

Teknium 🪽@Teknium

We are working on benching various combos of open source models to see if we can get Opus levels with much cheaper models as well, stay tuned!

Teknium 🪽@Teknium

Introducing Mixture of Agents 2.0 in Hermes Agent.

Combine any provider's models into a mixture of your own. Access your presets as if it were a normal model in Hermes.

Big improvement in our soon-to-release HermesBench against opus and gpt-5.5 with MoA using Opus & GPT together.

11h4.5K1415

Teknium 🪽@Teknium

We have Fable at home

Teknium 🪽@Teknium

Introducing Mixture of Agents 2.0 in Hermes Agent.

Combine any provider's models into a mixture of your own. Access your presets as if it were a normal model in Hermes.

Big improvement in our soon-to-release HermesBench against opus and gpt-5.5 with MoA using Opus & GPT together.

11h3.4K1290

Nous Research@NousResearch

@SemperApertus Very soon!

11h4.8K57

Ex Machina@SemperApertus

@NousResearch Now give me easy cloud access with a nice interface and take my goddman money!!

11h5.8K56

Nous Research@NousResearch

@notismaelvega @OpenRouter Provider agnostic

11h5.2K482

Nous Research@NousResearch

@0xvsr We aim to empower

11h4.2K51

Chris J Terry@chrisjterry

@NousResearch Yesterday's project - 120M tokens for $14. By swapping these models you can get anything done, I mean ANYTHING.

11h882134

emozilla@theemozilla

@NousResearch feel bad for people stuck using gpt-5.5 and opus-4.8 when we've got hermes over here

11h1.6K361

Nous Research@NousResearch

@filibluster Can't do much with a "temporarily" gated model can you

11h3.5K34

Teknium 🪽@Teknium

@SurrealBlend @NousResearch Lots of combinations possible. Also you aren't limited to 2 models. You could have 5 opuses, 3 gpt's, one grok, and a gemini for good measure

11h1.8K43