@OrcaRouter: Fable 5 is dead. We just resurrected it — cheaper, open and you hold the keys. OpenRouter dropped Fusion 48h ago and br…
Summary
OrcaRouter is a new AI gateway that intelligently routes prompts to the best model, offering cost savings, guardrails, and full observability with zero token markup and a free tier.
View Cached Full Text
Cached at: 06/15/26, 07:09 PM
Fable 5 is dead. We just resurrected it — cheaper, open and you hold the keys.
OpenRouter dropped Fusion 48h ago and broke the internet.
We tested it hard. The synthesizer is insane for deep research… but absolute dogshit for coding. So we fixed it.
Meet http://OrcaRouter.ai DSL — the version you actually own. One prompt → fans out to any panel you want → judge + synthesizer → one god-tier answer.
But unlike black-box slugs, you control the entire graph in YAML.
Fable 5 level intelligence… without waiting for Anthropic to turn it back on
OrcaRouter — One AI gateway: adaptive LLM routing & governance
Source: https://www.orcarouter.ai/
OrcaRouterOne gateway · every model · all your AI traffic
One Gateway. Every Model.Route Smarter. Ship Safer. Spend Less.
OrcaRouter grades every prompt and routes it intelligently. Frontier-quality AI at up to 40% lower cost. Adaptive routing, load balancing, guardrails, agent firewall, observability, and governance — all through a single OpenAI-compatible endpoint.
No credit card · live in 60 secs
- client = OpenAI(api_key="sk-...")+ client = OpenAI(+ base_url="https://api.orcarouter.ai/v1",+ api_key="sk-orca-..."+ )# Everything else stays the same.response = client.chat.completions.create( model="orcarouter/auto", # router picks the best model per request messages=[{"role": "user", "content": "..."}])# → orcarouter/auto grades the prompt → frontier or open-source, zero token markup ✓
One line. We grade each prompt, route to frontier or OSS, and add $0.
grok/grok-4.31\.25in·2.50out—
Anthropic: Claude Opus 4.85\.00in·25.00outAnthropic Direct
Anthropic: Claude Opus 4.75\.00in·25.00outAnthropic Direct
Google: Gemini 3.1 Pro Preview2\.00in·12.00outGoogle Direct
OpenAI: GPT-5.5 Pro30\.00in·180.00outOpenAI Direct
grok/grok-4.31\.25in·2.50out—
Anthropic: Claude Opus 4.85\.00in·25.00outAnthropic Direct
Anthropic: Claude Opus 4.75\.00in·25.00outAnthropic Direct
Google: Gemini 3.1 Pro Preview2\.00in·12.00outGoogle Direct
OpenAI: GPT-5.5 Pro30\.00in·180.00outOpenAI Direct
Integrations
Works with the tools you already use
Drop-in OpenAI-compatible, or connect agents over the OrcaRouter MCP server — keep your SDK, framework and editor.
OrcaRouter MCP serverOpenAI SDKGoogle GenAI SDKAnthropic SDKLangChainLlamaIndexVercel AI SDKCamelAIDifyCursorOpenCodePromptfooOpenClawOpenHumanGitHubcURLand more
AI gateway for production
Smart routing and automatic failover on every request.
Routing that’s measurably more accurate.
Every prompt is embedded and routed by a model that keeps learning online from real traffic. On the public RouterArena leaderboard (Jun 2026) it leads on accuracy — ahead of GPT-5, Azure, Martian and NotDiamond — at 75.5%.
contextual embeddingsonline learning<1ms overheadRouterArena
* Based on RouterArena leaderboard data, June 2026.
A provider goes down. No one notices.
When a provider rate-limits or 5xxs, OrcaRouter retries the request against a healthy model across 200+ options before the response starts — so transient upstream outages don’t surface to your users.
200+ modelsauto-failoverno 429
See and prove every call — cost, model, latency, and why.
See everything. Prove anything.
See exactly what every request cost, which model served it, how long it took, and why it failed — full structured logs you can filter, replay, and copy as a runnable cURL. A route is never a black box.
Per-request logsgrade · model · costcopy-as-cURL
Zero markup. Zero black boxes.
You pay each provider their exact price — we add $0 per token, ever. Every request shows the grade, chosen model, provider, latency, and price, so cost is glass-box, not an opaque blended rate.
$0 / tokenprovider costglass-box receipt
Versioned prompts and caching — without a redeploy.
Change prompts. Not code.
Version prompts behind named labels with A/B splits and one-click rollback. Move a label and every request picks it up instantly — no redeploy, no code change, no client update.
VersionedA/BInstant rollbackNo deploy
Pay once. Reuse for free.
Repeated and cached prompt tokens bill at the provider’s cache rate — often a fraction of the input price — across 5-minute and 1-hour ephemeral windows. Same answers, less spend, with cached_tokens on every receipt.
cache_controlcached_tokens5m / 1h windows
Guardrails, budgets, and an agent firewall that enforces.
Guardrails that stop things.
PII Shield and content policies run before the upstream call is billed. A blocked request returns a clean 400 and is never charged — guardrails enforced inline, not logged after the fact.
PII Shieldenforced pre-billingclean 400
Safe for your team. And your agents.
Budgets and roles for people; a risk-scored firewall for agents. Every tool and MCP call is graded ALLOW, REVIEW, or BLOCK before it runs, and anomaly detection flags rate and cost spikes against learned hour-of-week baselines.
ALLOW · REVIEW · BLOCKMCP gatinganomaly detection
Built for the agent era. Before you needed it.
Setup
Live in 60 seconds.
One URL change. Your existing SDK, model names, and streaming all work exactly as before.
Step 1
🔗
Point your SDK at us
Setbase\_urltoapi\.orcarouter\.ai/v1and swap your API key. No other code changes needed.
→
Step 2
⚡
We route, guard & observe
Every call is routed to the best model, checked against your guardrails, and metered — graded in under 1ms, with failover, caching and full logs built in.
→
Step 3
✓
You ship, on one endpoint
Traffic goes direct to each provider’s first-party API at their published rate — we add $0 per token. One OpenAI-compatible endpoint for routing, observability and governance.
Every model. One price list.
200+ models with live, side-by-side pricing — what you’d pay the provider directly. We add $0 on top.
ModelRouted toInput /MOutput /MContextQualitykimi/kimi-k2.7-codeNEWMoonshot0\.9504.00262K8.0qwen/qwen3.7-plusNEWAlibaba Cloud0\.3501.421M8.0minimax/minimax-m3NEW—0\.3001.201M9.0anthropic/claude-opus-4.8NEWAnthropic Direct5\.0025.001M10.0google/gemini-3.5-flashNEWGoogle Direct1\.509.001M9.0qwen/qwen3.7-maxNEWAlibaba Cloud1\.253.751M5.0qwen/qwen3.7-max-2026-05-20NEWAlibaba Cloud1\.253.751M5.0qwen/qwen3.6-flashAlibaba Cloud0\.2501.501M7.0qwen/qwen3.6-35b-a3bAlibaba Cloud0\.2481.48262K8.0openai/gpt-5.5-proOpenAI Direct30\.00180.00—10.0openai/gpt-5.5OpenAI Direct5\.0030.00—10.0deepseek/deepseek-v4-proDeepSeek0\.4560.9101M9.0deepseek/deepseek-v4-flashDeepSeek0\.1470.2941M8.0anthropic/claude-opus-4.7Anthropic Direct5\.0025.001M10.0z-ai/glm-5.1Zhipu AI1\.404.40200K9.0+ 194 more models · Prices update every 60 seconds
Everything your OpenAI client already calls.
Streaming, tool calls, structured outputs, vision, embeddings and audio — routed unchanged across every model.
ModelStreamingToolsStructuredVisionEmbeddingsAudiogrok/grok-4.3supportedsupportedsupportedsupportednot supportednot supportedanthropic/claude-opus-4.8supportedsupportedsupportedsupportednot supportednot supportedanthropic/claude-opus-4.7supportedsupportedsupportedsupportednot supportednot supportedgoogle/gemini-3.1-pro-previewsupportedsupportedsupportedsupportednot supportedsupportedopenai/gpt-5.5-prosupportedsupportedsupportedsupportednot supportednot supported
Pricing
Routing is free. Pay for features.
We never take a cut of your token spend. Our revenue comes from optional team features.
Zero markup guarantee
You pay providers directly at their published rates. We add nothing on top of token costs. Routing is free; the optional Team plan funds the platform.
$0.00routing fee
Hacker
Free
Forever. Zero markup on all tokens.
✓ Route — 200+ models, auto-failover
✓ Observe — basic dashboard
✓ Manage — prompt versioning
✓ 3 API keys · 0% token markup
Team
$499/mo
Still zero markup. Pay for features.
✓ Everything in Hacker
✓ Up to 10 team seats
✓ Compliance enforcement & reports
✓ Unlimited API keys
✓ Priority support
Enterprise
Custom
SLA commitments + private deployment.
✓ Everything in Team
✓ Private / on-prem deployment
✓ 99.99% uptime SLA
✓ Dedicated infrastructure
✓ Dedicated support & custom pricing
Trust & Compliance
Independently audited. Continuously compliant.
Audit reports available under NDA — request a copy below.
Smarter, safer, cost-efficient.
Swap one line. That’s the migration.
Sign up with GitHub — $5 in tokens free. No credit card required. You’re live in under a minute.
OrcaRouter
© 2026 OrcaRouter
Similar Articles
Anyways while some of you are dooming about Fable Open Router announced Fusion
Open Router announced Fusion, a new system that may be similar to OpenAI's pro model, generating excitement in the community.
Fable 5 Is Dead. And Honestly? We Might Be Better Off
US government forced Anthropic to pull its most powerful model, Fable 5, just days after launch. New benchmarks from OpenRouter show that fused panels of budget models can match or exceed Fable 5's performance at half the cost, raising questions about the value of frontier models.
Openrouter Fusion API
OpenRouter's Fusion API offers pricing and provider information for routing AI model requests across multiple providers, enabling flexible and cost-effective access to various AI models.
@iamtrask: This is a *way* bigger deal than it seems... Frontier AI companies will *never* own the frontier again I kid you not...…
OpenRouter launches Fusion API, a compound model that combines multiple AI models to achieve fable-level intelligence at half the price, potentially shifting the frontier of AI performance.
@alexatallah: If you're a researcher looking to: → conduct rigorous studies on how multiple models can outperform the frontier → leve…
OpenRouter launches Fusion API, a compound model that achieves high intelligence at half the price, leveraging the largest LLM marketplace.