Combine multiple LLMs into one OpenAI-compatible endpoint. You call one model — the router routes each request to the best LLM.
You select one model name for your product. Connect providers, write routing rules in plain English, and serve a single OpenAI-compatible endpoint — clients always pass that one model.
Add OpenAI, Anthropic, OpenRouter, or any compatible API. These are the models your compound model selects from.
Describe when to use which model. Strong models for hard tasks, cheap ones for simple work, vision models for images. No DSL — just write what you mean.
Clients call the model name you chose. Behind it, the router applies your rules each request, pins threads when you want consistency, falls back on failure, and records every decision.
You define how models are chosen in plain English and which providers are in play. Fallbacks, thread pinning, circuit breakers, and routing traces are part of the product so automation stays transparent and under your control.
Connect OpenAI, Anthropic, OpenRouter, or any compatible API. Your compound model selects from exactly the providers you choose.
Describe routing logic in natural language — no DSL, no YAML. Tell the router which models to use and when, and it executes.
Start with presets like Fast Coding, Speed-First, or Customer Support — then customize the model mix, instructions, and fallbacks to your product.
Three modes — re-evaluate every turn, pin for 1–6 turns then re-route, or pin on first message. Use $$route to force re-evaluation mid-thread.
If a provider fails, the router falls back to the next model in your chain. The circuit breaker tracks error rate, fallback rate, and latency spikes — unhealthy models are auto-disabled for 30 minutes.
Every response includes x-router-model-selected, x-router-request-id, and x-router-degraded headers. See which model handled each request and why.
Every hosted tier includes the full router: auto model routing, thread pinning, circuit breakers, and routing explanations. You can also self-host the same open-source router on your own Cloudflare account.
Evaluate the hosted router with no commitment.
$60 billed yearly
Limited-time promotional pricing.
For individual builders shipping real products with multi-model routing.
$7.99 billed monthly
$60 billed yearly
Promo redemptions start Starter immediately and downgrade to Free when the promo ends unless you upgrade separately.
Limited-time promotional pricing.
Save about 37% with yearly billing.
If you need to sign in first, we'll take you straight to Billing with your promo code ready.
Custom volume, self-hosted deployment help, and SLA.
All plans use the same router engine. The only difference is daily request volume and support level.
Want full control of your infrastructure? Self-host on your own Cloudflare account →
Prefer to inspect the code first? Browse the open-source repo →