Now I just need to find a way to use this one. Someone help me out.
Guys you’re still trying to replace Fable. I’m telling you we can’t do it. But we can recreate it in the aggregate.
It challenges single frontier model dominance on cost-accuracy trade-offs.
Now I just need to find a way to use this one. Someone help me out.
Guys you’re still trying to replace Fable. I’m telling you we can’t do it. But we can recreate it in the aggregate.
Positive users praise OpenRouter's Fusion API for advancing router-based compound models and task routing economics, while negative users call the launch fraudulent or criticize labs for losing orchestration advantage.
This is an insane release from OpenRouter, and not just because it's perfect timing.
It shows that frontier models alone do not own all the points on the cost-accuracy Pareto curve for knowledge work tasks; in fact they may not be on the Pareto curve at all. The Pareto curve may be defined by a mixture of models, which any independent third-party (e.g. an AI startup) has access to but the model labs do not.
It's also surprising because this feature seems extremely horizontal and is not even well-tuned for a specific task. You can prompt the Fusion API with anything. This just means that for any given workflow subset, there's even greater alpha to exploit, by hillclimbing a task-specific benchmark. The more specific the workflow, the more hillclimbing you can do.
This should be pretty obvious with a practical example - if you're trying to automate invoice reconciliation at scale, you can be orders of magnitude cheaper and more reliable than "raw" Claude by tuning an agentic workflow with a mixture of models for document extraction, line-item validation, and contract matching.
That alpha is what's exploitable by any company out there that's not a frontier lab.
Introducing the Fusion API, the smartest compound model in the market.
Fusion achieves Fable-level intelligence at half the price.
How it works 👇

@jerryjliu0 I am taking this release with a massive grain of salt. Do you really think OpenAI, Anthropic, even Meta / xAI would miss something so obvious? Not to mention the benchmarks not passing the smell test.

@jerryjliu0 Yeah, this. Most of the work is routine: summaries, retrieval, basic rewrites. You don't need top-of-the-line for that.
The economics flip the moment you route by task type.

@jerryjliu0 Labs built the models and gave away the orchestration advantage. Classic.

@jerryjliu0 这不是骗子是什么?

@jerryjliu0 but it also shows deepseek is better than gpt-5.5 🙄🤥

@jerryjliu0 You don't even need a router. Simply have a second LLM review and critique the plan from the first LLM. This approach frequently leads to major improvements, helps avoid problems, and reduces the total number of reasoning cycles.

@jerryjliu0 the alpha lives in the router not the mixture. classifier picks the route, eval grades it, kv cache invalidates across heterogeneous calls. the pareto frontier is bounded by router accuracy. what's the router's own eval?

@jerryjliu0 that Pareto line gets weird when models start dividing labor on the backend. compound agents eating everyone's lunch

@jerryjliu0 Totally agree—Fusion makes the “router + judge + synthesis” layer feel like the new Pareto frontier, not any single frontier model. Curious: do you think the durable moat is (1) task-specific evals + hillclimbing loops, or (2) proprietary workflow data/feedback at scale?