This is an insane release from OpenRouter, and not just because it's perfect timing.
It shows that frontier models alone do not own all the points on the cost-accuracy Pareto curve for knowledge work tasks; in fact they may not be on the Pareto curve at all. The Pareto curve may be defined by a mixture of models, which any independent third-party (e.g. an AI startup) has access to but the model labs do not.
It's also surprising because this feature seems extremely horizontal and is not even well-tuned for a specific task. You can prompt the Fusion API with anything. This just means that for any given workflow subset, there's even greater alpha to exploit, by hillclimbing a task-specific benchmark. The more specific the workflow, the more hillclimbing you can do.
This should be pretty obvious with a practical example - if you're trying to automate invoice reconciliation at scale, you can be orders of magnitude cheaper and more reliable than "raw" Claude by tuning an agentic workflow with a mixture of models for document extraction, line-item validation, and contract matching.
That alpha is what's exploitable by any company out there that's not a frontier lab.
Introducing the Fusion API, the smartest compound model in the market.
Fusion achieves Fable-level intelligence at half the price.
How it works 👇


















