this is pretty genius. in a world of increasingly expensive and abundant ai models products like this are a dream
AI model usage has shifted in a huge way recently:
- flat fee subscriptions to usage-based models
- single model usage to multi-model prompt routing
why? subscriptions were too heavily subsidised and different models are good for different things
every fortune500 company has BOTH a claude AND chatgpt sub. engineers are using claude to write a plan and codex to execute
factory router claims to save you 25% cost while maintaining frontier results.
if you’re a company spending $100M per year on tokens (and there will be many) then saving $25M is a a no-brainer
for routing companies like factory they then build a huge moat around intents.
v smart (and no, i’m not an investor, i just like the tech)