@GitHub_Daily: 最近一个名为 9router 的开源工具火了,给所有 AI 编程工具加了一个智能调度中心。 像平时用 Claude Code 写代码,API 额度消耗极快,遇到大段报错日志更是瞬间把 Token 烧光。 而 9router 则内置了智能压…

X AI KOLs Timeline 工具

摘要

9router 是一个开源的 AI 编程工具智能路由中心,内置压缩算法节省 Token,支持三级降级自动切换模型,原生兼容 Claude Code、Cursor 等主流工具,可路由到数十家模型服务商,有效降低 API 调用成本。

最近一个名为 9router 的开源工具火了,给所有 AI 编程工具加了一个智能调度中心。 像平时用 Claude Code 写代码,API 额度消耗极快,遇到大段报错日志更是瞬间把 Token 烧光。 而 9router 则内置了智能压缩算法,自动精简终端日志或代码差异中的冗余信息,每次请求能帮我们省下不少 Token 消耗。 核心亮点是实用的三级降级机制。 遇到主力模型额度耗尽或报错,会自动无缝切换到备用的便宜或免费模型,确保开发过程不被打断。 原生兼容 Claude Code、Codex、Cursor各类主流 AI 编程工具,轻松将请求路由到几十家不同的模型服务商。 GitHub:http://github.com/decolua/9router 并且提供了开箱即用的部署方案,通过 npm 全局安装或 Docker 就能快速跑起来。 自带直观的本地数据面板,方便我们随时监控额度消耗并灵活配置模型组合。 非常适合重度依赖 AI 辅助编程的朋友,能有效降低接口调用成本,值得部署体验。
查看原文 导出为 Word 导出为 PDF
查看缓存全文

缓存时间: 2026/05/11 12:41

最近一个名为 9router 的开源工具火了,给所有 AI 编程工具加了一个智能调度中心。 像平时用 Claude Code 写代码,API 额度消耗极快,遇到大段报错日志更是瞬间把 Token 烧光。 而 9router 则内置了智能压缩算法,自动精简终端日志或代码差异中的冗余信息,每次请求能帮我们省下不少 Token 消耗。 核心亮点是实用的三级降级机制。 遇到主力模型额度耗尽或报错,会自动无缝切换到备用的便宜或免费模型,确保开发过程不被打断。 原生兼容 Claude Code、Codex、Cursor各类主流 AI 编程工具,轻松将请求路由到几十家不同的模型服务商。 GitHub:http://github.com/decolua/9router 并且提供了开箱即用的部署方案,通过 npm 全局安装或 Docker 就能快速跑起来。 自带直观的本地数据面板,方便我们随时监控额度消耗并灵活配置模型组合。 非常适合重度依赖 AI 辅助编程的朋友,能有效降低接口调用成本,值得部署体验。


decolua/9router

Source: https://github.com/decolua/9router

9Router Dashboard

9Router - FREE AI Router & Token Saver

Never stop coding. Save 20-40% tokens with RTK + auto-fallback to FREE & cheap AI models.

Connect All AI Code Tools (Claude Code, Cursor, Antigravity, Copilot, Codex, Gemini, OpenCode, Cline, OpenClaw…) to 40+ AI Providers & 100+ Models.

npm Downloads License

decolua%2F9router | Trendshift

🚀 Quick Start💡 Features📖 Setup🌐 Website

🇻🇳 Tiếng Việt🇨🇳 中文🇯🇵 日本語


🤔 Why 9Router?

Stop wasting money, tokens and hitting limits:

  • ❌ Subscription quota expires unused every month
  • ❌ Rate limits stop you mid-coding
  • ❌ Tool outputs (git diff, grep, ls…) burn tokens fast
  • ❌ Expensive APIs ($20-50/month per provider)
  • ❌ Manual switching between providers

9Router solves this:

  • RTK Token Saver - Auto-compress tool_result content, save 20-40% tokens per request
  • Maximize subscriptions - Track quota, use every bit before reset
  • Auto fallback - Subscription → Cheap → Free, zero downtime
  • Multi-account - Round-robin between accounts per provider
  • Universal - Works with Claude Code, Codex, Cursor, Cline, any CLI tool

🔄 How It Works

┌─────────────┐
│  Your CLI   │  (Claude Code, Codex, OpenClaw, Cursor, Cline...)
│   Tool      │
└──────┬──────┘
       │ http://localhost:20128/v1
       ↓
┌─────────────────────────────────────────────┐
│           9Router (Smart Router)            │
│  • RTK Token Saver (cut tool_result tokens) │
│  • Format translation (OpenAI ↔ Claude)     │
│  • Quota tracking                           │
│  • Auto token refresh                       │
└──────┬──────────────────────────────────────┘
       │
       ├─→ [Tier 1: SUBSCRIPTION] Claude Code, Codex, GitHub Copilot
       │   ↓ quota exhausted
       ├─→ [Tier 2: CHEAP] GLM ($0.6/1M), MiniMax ($0.2/1M)
       │   ↓ budget limit
       └─→ [Tier 3: FREE] Kiro, OpenCode Free, Vertex ($300 credits)

Result: Never stop coding, minimal cost + 20-40% token savings via RTK

⚡ Quick Start

1. Install globally:

npm install -g 9router
9router

🎉 Dashboard opens at http://localhost:20128

2. Connect a FREE provider (no signup needed):

Dashboard → Providers → Connect Kiro AI (free Claude unlimited) or OpenCode Free (no auth) → Done!

3. Use in your CLI tool:

Claude Code/Codex/OpenClaw/Cursor/Cline Settings:
  Endpoint: http://localhost:20128/v1
  API Key: [copy from dashboard]
  Model: kr/claude-sonnet-4.5

That’s it! Start coding with FREE AI models.

Alternative: run from source (this repository):

This repository package is private (9router-app), so source/Docker execution is the expected local development path.

cp .env.example .env
npm install
PORT=20128 NEXT_PUBLIC_BASE_URL=http://localhost:20128 npm run dev

Production mode:

npm run build
PORT=20128 HOSTNAME=0.0.0.0 NEXT_PUBLIC_BASE_URL=http://localhost:20128 npm run start

Default URLs:

  • Dashboard: http://localhost:20128/dashboard
  • OpenAI-compatible API: http://localhost:20128/v1

Video Guides

9Router Setup Tutorial
🇺🇸 English
9Router + Claude Code FREE Setup
by Build AI With Hamid
Tiết kiệm chi phí LLM với 9Router
🇻🇳 Tiếng Việt
Tiết kiệm chi phí LLM cho OpenClaw với 9Router
by Mì AI
Claude Code FREE Forever
🇺🇸 English
Claude Code FREE Forever — Unlimited Models
by Build AI With Hamid
Claude CLI Free Setup
🇺🇸 English
Claude CLI Free Setup with 9Router 🚀
by CodeVerse Soban
Cài đặt OpenClaw Free A-Z
🇻🇳 Tiếng Việt
Cài Đặt OpenClaw Free Từ A-Z + 9Router
by Mai Gia
FREE OpenClaw with Claude Opus
🇺🇸 English
FREE OpenClaw + Claude Opus 4.6
by Build AI With Hamid

🎬 Made a video about 9Router? Submit a Pull Request adding your video to this section — we’ll merge it!


🛠️ Supported CLI Tools

9Router works seamlessly with all major AI coding tools:

Claude Code
Claude-Code
OpenClaw
OpenClaw
Codex
Codex
OpenCode
OpenCode
Cursor
Cursor
Antigravity
Antigravity
Cline
Cline
Continue
Continue
Droid
Droid
Roo
Roo
Copilot
Copilot
Kilo Code
Kilo Code

🌐 Supported Providers

🔐 OAuth Providers

Claude Code
Claude-Code
Antigravity
Antigravity
Codex
Codex
GitHub
GitHub
Cursor
Cursor

🆓 Free Providers

Kiro
Kiro AI
Claude 4.5 + GLM-5 + MiniMax
Unlimited FREE
OpenCode Free
OpenCode Free
No auth • Auto-fetch models
Unlimited FREE
Vertex AI
Vertex AI
Gemini 3 Pro + GLM-5 + DeepSeek
$300 credits free

Note: iFlow, Qwen and Gemini CLI free tiers were discontinued in 2026. Use Kiro / OpenCode Free / Vertex instead.

🔑 API Key Providers (40+)

OpenRouter
OpenRouter
GLM
GLM
Kimi
Kimi
MiniMax
MiniMax
OpenAI
OpenAI
Anthropic
Anthropic
Gemini
Gemini
DeepSeek
DeepSeek
Groq
Groq
xAI
xAI
Mistral
Mistral
Perplexity
Perplexity
Together
Together AI
Fireworks
Fireworks
Cerebras
Cerebras
Cohere
Cohere
NVIDIA
NVIDIA
SiliconFlow
SiliconFlow

...and 20+ more providers including Nebius, Chutes, Hyperbolic, and custom OpenAI/Anthropic compatible endpoints


💡 Key Features

FeatureWhat It DoesWhy It Matters
🚀 RTK Token Saver (RTK ⭐40K)Compress tool outputs (git diff, grep, ls, tree…) before sending to LLMSave 20-40% input tokens per request
🪨 Caveman Mode (Caveman ⭐52K)Inject caveman-speak prompt → LLM replies terse, technical substance preservedSave up to 65% output tokens
🎯 Smart 3-Tier FallbackAuto-route: Subscription → Cheap → FreeNever stop coding, zero downtime
📊 Real-Time Quota TrackingLive token count + reset countdownMaximize subscription value
🔄 Format TranslationOpenAI ↔ Claude ↔ Gemini ↔ Cursor ↔ Kiro ↔ VertexWorks with any CLI tool
👥 Multi-Account SupportMultiple accounts per providerLoad balancing + redundancy
🔄 Auto Token RefreshOAuth tokens refresh automaticallyNo manual re-login needed
🎨 Custom CombosCreate unlimited model combinationsTailor fallback to your needs
📝 Request LoggingDebug mode with full request/response logsTroubleshoot issues easily
💾 Cloud SyncSync config across devicesSame setup everywhere
📊 Usage AnalyticsTrack tokens, cost, trends over timeOptimize spending
🌐 Deploy AnywhereLocalhost, VPS, Docker, Cloudflare WorkersFlexible deployment options
📖 Feature Details

🚀 RTK Token Saver

Tool outputs (git diff, grep, find, ls, tree, log dumps…) often eat 30-50% of your prompt budget. RTK detects them and applies smart, lossless compression before the request hits the LLM:

  • Filters: git-diff, git-status, grep, find, ls, tree, dedup-log, smart-truncate, read-numbered, search-list
  • Auto-detect: No config needed — RTK peeks the first 1KB of each tool_result and picks the right filter.
  • Safe by design: If a filter fails, throws, or makes output bigger, RTK silently keeps the original text. Errors never break your request.
  • Universal: Works across all formats (OpenAI, Claude, Gemini, Cursor, Kiro, OpenAI Responses) because it runs before any format translation.
  • Default ON: Toggle anytime in Dashboard → Endpoint settings.
Without RTK: 47K tokens sent to LLM
With RTK:    28K tokens sent to LLM   (40% saved · same context · same answer)

🎯 Smart 3-Tier Fallback

Create combos with automatic fallback:

Combo: "my-coding-stack"
  1. cc/claude-opus-4-6        (your subscription)
  2. glm/glm-4.7               (cheap backup, $0.6/1M)
  3. if/kimi-k2-thinking       (free fallback)

→ Auto switches when quota runs out or errors occur

📊 Real-Time Quota Tracking

  • Token consumption per provider
  • Reset countdown (5-hour, daily, weekly)
  • Cost estimation for paid tiers
  • Monthly spending reports

🔄 Format Translation

Seamless translation between formats:

  • OpenAIClaudeGeminiCursorKiroVertexAntigravityOllamaOpenAI Responses
  • Your CLI tool sends OpenAI format → 9Router translates → Provider receives native format
  • Works with any tool that supports custom OpenAI endpoints

👥 Multi-Account Support

  • Add multiple accounts per provider
  • Auto round-robin or priority-based routing
  • Fallback to next account when one hits quota

🔄 Auto Token Refresh

  • OAuth tokens automatically refresh before expiration
  • No manual re-authentication needed
  • Seamless experience across all providers

🎨 Custom Combos

  • Create unlimited model combinations
  • Mix subscription, cheap, and free tiers
  • Name your combos for easy access
  • Share combos across devices with Cloud Sync

📝 Request Logging

  • Enable debug mode for full request/response logs
  • Track API calls, headers, and payloads
  • Troubleshoot integration issues
  • Export logs for analysis

💾 Cloud Sync

  • Sync providers, combos, and settings across devices
  • Automatic background sync
  • Secure encrypted storage
  • Access your setup from anywhere

Cloud Runtime Notes

  • Prefer server-side cloud variables in production:
    • BASE_URL (internal callback URL used by sync scheduler)
    • CLOUD_URL (cloud sync endpoint base)
  • NEXT_PUBLIC_BASE_URL and NEXT_PUBLIC_CLOUD_URL are still supported for compatibility/UI, but server runtime now prioritizes BASE_URL/CLOUD_URL.
  • Cloud sync requests now use timeout + fail-fast behavior to avoid UI hanging when cloud DNS/network is unavailable.

📊 Usage Analytics

  • Track token usage per provider and model
  • Cost estimation and spending trends
  • Monthly reports and insights
  • Optimize your AI spending

💡 IMPORTANT - Understanding Dashboard Costs:

The “cost” displayed in Usage Analytics is for tracking and comparison purposes only. 9Router itself never charges you anything. You only pay providers directly (if using paid services).

Example: If your dashboard shows “290 total cost" while using iFlow models, this represents what you would have paid using paid APIs directly. Your actual cost = **0** (iFlow is free unlimited).

Think of it as a “savings tracker” showing how much you’re saving by using free models or routing through 9Router!

🌐 Deploy Anywhere

  • 💻 Localhost - Default, works offline
  • ☁️ VPS/Cloud - Share across devices
  • 🐳 Docker - One-command deployment
  • 🚀 Cloudflare Workers - Global edge network

💰 Pricing at a Glance

TierProviderCostQuota ResetBest For
🚀 TOKEN SAVERRTK (built-in)FREEAlways onSave 20-40% tokens on EVERY request
💳 SUBSCRIPTIONClaude Code (Pro/Max)$20-200/mo5h + weeklyAlready subscribed
Codex (Plus/Pro)$20-200/mo5h + weeklyOpenAI users
GitHub Copilot$10-19/moMonthlyGitHub users
Cursor IDE$20/moMonthlyCursor users
💰 CHEAPGLM-5.1 / GLM-4.7$0.6/1MDaily 10AMBudget backup
MiniMax M2.7$0.2/1M5-hour rollingCheapest option
Kimi K2.5$9/mo flat10M tokens/moPredictable cost
🆓 FREEKiro AI$0UnlimitedClaude 4.5 + GLM-5 + MiniMax free
OpenCode Free$0UnlimitedNo auth, auto-fetch models
Vertex AI$300 creditsNew GCP accountsGemini 3 Pro + DeepSeek + GLM-5

💡 Pro Tip: RTK + Kiro AI + OpenCode Free combo = $0 cost + 20-40% token savings!


📊 Understanding 9Router Costs & Billing

9Router Billing Reality:

9Router software = FREE forever (open source, never charges)
Dashboard “costs” = Display/tracking only (not actual bills)
You pay providers directly (subscriptions or API fees)
FREE providers stay FREE (iFlow, Kiro, Qwen = $0 unlimited)
9Router never sends invoices or charges your card

How Cost Display Works:

The dashboard shows estimated costs as if you were using paid APIs directly. This is not billing - it’s a comparison tool to show your savings.

Example Scenario:

Dashboard Display:
• Total Requests: 1,662
• Total Tokens: 47M
• Display Cost: $290

Reality Check:
• Provider: iFlow (FREE unlimited)
• Actual Payment: $0.00
• What $290 Means: Amount you SAVED by using free models!

Payment Rules:

  • Subscription providers (Claude Code, Codex): Pay them directly via their websites
  • Cheap providers (GLM, MiniMax): Pay them directly, 9Router just routes
  • FREE providers (iFlow, Kiro, Qwen): Genuinely free forever, no hidden charges
  • 9Router: Never charges anything, ever

🎯 Use Cases

Case 1: “I have Claude Pro subscription”

Problem: Quota expires unused, rate limits during heavy coding

Solution:

Combo: "maximize-claude"
  1. cc/claude-opus-4-7        (use subscription fully)
  2. glm/glm-5.1               (cheap backup when quota out)
  3. kr/claude-sonnet-4.5      (free emergency fallback)

Monthly cost: $20 (subscription) + ~$5 (backup) = $25 total
vs. $20 + hitting limits = frustration

Case 2: “I want zero cost”

Problem: Can’t afford subscriptions, need reliable AI coding

Solution:

Combo: "free-forever"
  1. kr/claude-sonnet-4.5      (Claude 4.5 free unlimited)
  2. kr/glm-5                  (GLM-5 free via Kiro)
  3. oc/<auto>                 (OpenCode Free, no auth)

Monthly cost: $0
Quality: Production-ready models + RTK saves 20-40% tokens

Case 3: “I need 24/7 coding, no interruptions”

Problem: Deadlines, can’t afford downtime

Solution:

Combo: "always-on"
  1. cc/claude-opus-4-7        (best quality)
  2. cx/gpt-5.5                (second subscription)
  3. glm/glm-5.1               (cheap, resets daily)
  4. minimax/MiniMax-M2.7      (cheapest, 5h reset)
  5. kr/claude-sonnet-4.5      (free unlimited)

Result: 5 layers of fallback = zero downtime
Monthly cost: $20-200 (subscriptions) + $10-20 (backup)

Case 4: “I want FREE AI in OpenClaw”

Problem: Need AI assistant in messaging apps (WhatsApp, Telegram, Slack…), completely free

Solution:

Combo: "openclaw-free"
  1. kr/claude-sonnet-4.5      (Claude 4.5 free)
  2. kr/glm-5                  (GLM-5 free)
  3. kr/MiniMax-M2.5           (MiniMax free)

Monthly cost: $0
Access via: WhatsApp, Telegram, Slack, Discord, iMessage, Signal...

❓ Frequently Asked Questions

📊 Why does my dashboard show high costs?

The dashboard tracks your token usage and displays estimated costs as if you were using paid APIs directly. This is not actual billing - it’s a reference to show how much you’re saving by using free models or existing subscriptions through 9Router.

Example:

  • Dashboard shows: “$290 total cost”
  • Reality: You’re using iFlow (FREE unlimited)
  • Your actual cost: $0.00
  • What $290 means: Amount you saved by using free models instead of paid APIs!

The cost display is a “savings tracker” to help you understand your usage patterns and optimization opportunities.

💳 Will I be charged by 9Router?

No. 9Router is free, open-source software that runs on your own computer. It never charges you anything.

You only pay:

  • Subscription providers (Claude Code $20/mo, Codex $20-200/mo) → Pay them directly on their websites
  • Cheap providers (GLM, MiniMax) → Pay them directly, 9Router just routes your requests
  • 9Router itselfNever charges anything, ever

9Router is a local proxy/router. It doesn’t have your credit card, can’t send invoices, and has no billing system. It’s completely free software.

🆓 Are FREE providers really unlimited?

Yes! The current FREE providers (Kiro, OpenCode Free, Vertex) are genuinely free with no hidden charges.

These are free services offered by those respective companies:

  • Kiro AI: Free unlimited Claude 4.5 + GLM-5 + MiniMax via AWS Builder ID / Google / GitHub OAuth
  • OpenCode Free: No-auth passthrough proxy, models auto-fetched from opencode.ai/zen/v1/models
  • Vertex AI: $300 free credits for new Google Cloud accounts (90 days)

9Router just routes your requests to them - there’s no “catch” or future billing. They’re truly free services, and 9Router makes them easy to use with fallback support.

Discontinued free tiers (no longer recommended):

  • iFlow: Was free unlimited, now changed to paid (2026)
  • Qwen Code: Free OAuth tier discontinued by Alibaba on 2026-04-15
  • Gemini CLI: Still works, but using it with non-CLI tools (Claude, Codex, Cursor…) may result in account bans — only use if you stick to Gemini CLI itself
💰 How do I minimize my actual AI costs?

Free-First Strategy:

  1. Start with 100% free combo:

    1. gc/gemini-3-flash (180K/month free from Google)
    2. if/kimi-k2-thinking (unlimited free from iFlow)
    3. qw/qwen3-coder-plus (unlimited free from Qwen)
    

    Cost: $0/month

  2. Add cheap backup only if you need it:

    4. glm/glm-4.7 ($0.6/1M tokens)
    

    Additional cost: Only pay for what you actually use

  3. Use subscription providers last:

    • Only if you already have them
    • 9Router helps maximize their value through quota tracking

Result: Most users can operate at $0/month using only free tiers!

📈 What if my usage suddenly spikes?

9Router’s smart fallback prevents surprise charges:

Scenario: You’re on a coding sprint and blow through your quotas

Without 9Router:

  • ❌ Hit rate limit → Work stops → Frustration
  • ❌ Or: Accidentally rack up huge API bills

With 9Router:

  • ✅ Subscription hits limit → Auto-fallback to cheap tier
  • ✅ Cheap tier gets expensive → Auto-fallback to free tier
  • ✅ Never stop coding → Predictable costs

You’re in control: Set spending limits per provider in dashboard, and 9Router respects them.


📖 Setup Guide

🔐 Subscription Providers (Maximize Value)

Claude Code (Pro/Max)

Dashboard → Providers → Connect Claude Code
→ OAuth login → Auto token refresh
→ 5-hour + weekly quota tracking

Models:
  cc/claude-opus-4-7
  cc/claude-opus-4-6
  cc/claude-sonnet-4-6
  cc/claude-haiku-4-5-20251001

Pro Tip: Use Opus for complex tasks, Sonnet for speed. 9Router tracks quota per model!

OpenAI Codex (Plus/Pro)

Dashboard → Providers → Connect Codex
→ OAuth login (port 1455)
→ 5-hour + weekly reset

Models:
  cx/gpt-5.5
  cx/gpt-5.4
  cx/gpt-5.3-codex
  cx/gpt-5.2-codex

GitHub Copilot

Dashboard → Providers → Connect GitHub
→ OAuth via GitHub
→ Monthly reset (1st of month)

Models:
  gh/gpt-5.4
  gh/claude-opus-4.7
  gh/claude-sonnet-4.6
  gh/gemini-3.1-pro-preview
  gh/grok-code-fast-1

Cursor IDE

Dashboard → Providers → Connect Cursor
→ OAuth login
→ Monthly subscription

Models:
  cu/claude-4.6-opus-max
  cu/claude-4.5-sonnet-thinking
  cu/gpt-5.3-codex
💰 Cheap Providers (Backup)

GLM-5.1 / GLM-4.7 (Daily reset, $0.6/1M)

  1. Sign up: Zhipu AI
  2. Get API key from Coding Plan
  3. Dashboard → Add API Key:
    • Provider: glm
    • API Key: your-key

Use: glm/glm-5.1, glm/glm-5, glm/glm-4.7

Pro Tip: Coding Plan offers 3× quota at 1/7 cost! Reset daily 10:00 AM.

MiniMax M2.7 (5h reset, $0.20/1M)

  1. Sign up: MiniMax
  2. Get API key
  3. Dashboard → Add API Key

Use: minimax/MiniMax-M2.7, minimax/MiniMax-M2.5

Pro Tip: Cheapest option for long context (1M tokens)!

Kimi K2.5 ($9/month flat)

  1. Subscribe: Moonshot AI
  2. Get API key
  3. Dashboard → Add API Key

Use: kimi/kimi-k2.5, kimi/kimi-k2.5-thinking

Pro Tip: Fixed $9/month for 10M tokens = $0.90/1M effective cost!

🆓 FREE Providers (Recommended)

Kiro AI (Claude 4.5 + GLM-5 + MiniMax FREE)

Dashboard → Connect Kiro
→ AWS Builder ID, AWS IAM Identity Center, Google, or GitHub
→ Unlimited usage

Models:
  kr/claude-sonnet-4.5
  kr/claude-haiku-4.5
  kr/glm-5
  kr/MiniMax-M2.5
  kr/qwen3-coder-next
  kr/deepseek-3.2

Pro Tip: Best free option for Claude. No API key, no payment, fully unlimited.

OpenCode Free (No auth, auto-fetch models)

Dashboard → Connect OpenCode Free
→ No login required (passthrough proxy)
→ Models auto-fetched from opencode.ai/zen/v1/models

Pro Tip: Fastest setup. Just connect and start coding.

Vertex AI ($300 free credits for new GCP accounts)

Dashboard → Connect Vertex AI
→ Upload Google Cloud Service Account JSON
→ Enable Vertex AI API in your GCP project

Models:
  vertex/gemini-3.1-pro-preview
  vertex/gemini-3-flash-preview
  vertex/gemini-2.5-flash

Vertex Partner (Anthropic / DeepSeek / GLM / Qwen via Vertex):
  vertex-partner/glm-5-maas
  vertex-partner/deepseek-v3.2-maas
  vertex-partner/qwen3-next-80b-a3b-thinking-maas

Pro Tip: New Google Cloud accounts get $300 credits free for 90 days. Plenty for daily coding.

🎨 Create Combos

Example 1: Maximize Subscription → Cheap Backup

Dashboard → Combos → Create New

Name: premium-coding
Models:
  1. cc/claude-opus-4-7 (Subscription primary)
  2. glm/glm-5.1 (Cheap backup, $0.6/1M)
  3. minimax/MiniMax-M2.7 (Cheapest fallback, $0.20/1M)

Use in CLI: premium-coding

Monthly cost example (100M tokens):
  80M via Claude (subscription): $0 extra
  15M via GLM: $9
  5M via MiniMax: $1
  Total: $10 + your subscription

Example 2: Free-Only (Zero Cost)

Name: free-combo
Models:
  1. kr/claude-sonnet-4.5 (Claude 4.5 free unlimited)
  2. kr/glm-5 (GLM-5 free via Kiro)
  3. vertex/gemini-3.1-pro-preview ($300 free credits)

Cost: $0 forever (+ 20-40% token savings via RTK)!
🔧 CLI Integration

Cursor IDE

Settings → Models → Advanced:
  OpenAI API Base URL: http://localhost:20128/v1
  OpenAI API Key: [from 9router dashboard]
  Model: cc/claude-opus-4-7

Or use combo: premium-coding

Claude Code

Edit ~/.claude/config.json:

{
  "anthropic_api_base": "http://localhost:20128/v1",
  "anthropic_api_key": "your-9router-api-key"
}

Codex CLI

export OPENAI_BASE_URL="http://localhost:20128"
export OPENAI_API_KEY="your-9router-api-key"

codex "your prompt"

OpenClaw

Option 1 — Dashboard (recommended):

Dashboard → CLI Tools → OpenClaw → Select Model → Apply

Option 2 — Manual: Edit ~/.openclaw/openclaw.json:

{
  "agents": {
    "defaults": {
      "model": {
        "primary": "9router/kr/claude-sonnet-4.5"
      }
    }
  },
  "models": {
    "providers": {
      "9router": {
        "baseUrl": "http://127.0.0.1:20128/v1",
        "apiKey": "sk_9router",
        "api": "openai-completions",
        "models": [
          {
            "id": "kr/claude-sonnet-4.5",
            "name": "Claude Sonnet 4.5 (Kiro Free)"
          }
        ]
      }
    }
  }
}

Note: OpenClaw only works with local 9Router. Use 127.0.0.1 instead of localhost to avoid IPv6 resolution issues.

Cline / Continue / RooCode

Provider: OpenAI Compatible
Base URL: http://localhost:20128/v1
API Key: [from dashboard]
Model: cc/claude-opus-4-7
🚀 Deployment

VPS Deployment

# Clone and install
git clone https://github.com/decolua/9router.git
cd 9router
npm install
npm run build

# Configure
export JWT_SECRET="your-secure-secret-change-this"
export INITIAL_PASSWORD="your-password"
export DATA_DIR="/var/lib/9router"
export PORT="20128"
export HOSTNAME="0.0.0.0"
export NODE_ENV="production"
export NEXT_PUBLIC_BASE_URL="http://localhost:20128"
export NEXT_PUBLIC_CLOUD_URL="https://9router.com"
export API_KEY_SECRET="endpoint-proxy-api-key-secret"
export MACHINE_ID_SALT="endpoint-proxy-salt"

# Start
npm run start

# Or use PM2
npm install -g pm2
pm2 start npm --name 9router -- start
pm2 save
pm2 startup

Docker

# Build image (from repository root)
docker build -t 9router .

# Run container (command used in current setup)
docker run -d \
  --name 9router \
  -p 20128:20128 \
  --env-file /root/dev/9router/.env \
  -v 9router-data:/app/data \
  -v 9router-usage:/root/.9router \
  9router

Portable command (if you are already at repository root):

docker run -d \
  --name 9router \
  -p 20128:20128 \
  --env-file ./.env \
  -v 9router-data:/app/data \
  -v 9router-usage:/root/.9router \
  9router

Container defaults:

  • PORT=20128
  • HOSTNAME=0.0.0.0

Useful commands:

docker logs -f 9router
docker restart 9router
docker stop 9router && docker rm 9router

Environment Variables

VariableDefaultDescription
JWT_SECRET9router-default-secret-change-meJWT signing secret for dashboard auth cookie (change in production)
INITIAL_PASSWORD123456First login password when no saved hash exists
DATA_DIR~/.9routerMain app database location (db.json)
PORTframework defaultService port (20128 in examples)
HOSTNAMEframework defaultBind host (Docker defaults to 0.0.0.0)
NODE_ENVruntime defaultSet production for deploy
BASE_URLhttp://localhost:20128Server-side internal base URL used by cloud sync jobs
CLOUD_URLhttps://9router.comServer-side cloud sync endpoint base URL
NEXT_PUBLIC_BASE_URLhttp://localhost:3000Backward-compatible/public base URL (prefer BASE_URL for server runtime)
NEXT_PUBLIC_CLOUD_URLhttps://9router.comBackward-compatible/public cloud URL (prefer CLOUD_URL for server runtime)
API_KEY_SECRETendpoint-proxy-api-key-secretHMAC secret for generated API keys
MACHINE_ID_SALTendpoint-proxy-saltSalt for stable machine ID hashing
ENABLE_REQUEST_LOGSfalseEnables request/response logs under logs/
AUTH_COOKIE_SECUREfalseForce Secure auth cookie (set true behind HTTPS reverse proxy)
REQUIRE_API_KEYfalseEnforce Bearer API key on /v1/* routes (recommended for internet-exposed deploys)
HTTP_PROXY, HTTPS_PROXY, ALL_PROXY, NO_PROXYemptyOptional outbound proxy for upstream provider calls

Notes:

  • Lowercase proxy variables are also supported: http_proxy, https_proxy, all_proxy, no_proxy.
  • .env is not baked into Docker image (.dockerignore); inject runtime config with --env-file or -e.
  • On Windows, APPDATA can be used for local storage path resolution.
  • INSTANCE_NAME appears in older docs/env templates, but is currently not used at runtime.

Runtime Files and Storage

  • Main app state: ${DATA_DIR}/db.json (providers, combos, aliases, keys, settings), managed by src/lib/localDb.js.
  • Usage history and logs: ${DATA_DIR}/usage.json and ${DATA_DIR}/log.txt, managed by src/lib/usageDb.js.
  • Optional request/translator logs: <repo>/logs/... when ENABLE_REQUEST_LOGS=true.
  • Both ${DATA_DIR} and ~/.9router resolve to the same location in a Docker container — the symlink /root/.9router -> /app/data is created at build time.

📊 Available Models

View all available models

Claude Code (cc/) - Pro/Max:

  • cc/claude-opus-4-7
  • cc/claude-opus-4-6
  • cc/claude-sonnet-4-6
  • cc/claude-sonnet-4-5-20250929
  • cc/claude-haiku-4-5-20251001

Codex (cx/) - Plus/Pro:

  • cx/gpt-5.5
  • cx/gpt-5.4
  • cx/gpt-5.3-codex
  • cx/gpt-5.2-codex
  • cx/gpt-5.1-codex-max

GitHub Copilot (gh/):

  • gh/gpt-5.4
  • gh/claude-opus-4.7
  • gh/claude-sonnet-4.6
  • gh/gemini-3.1-pro-preview
  • gh/grok-code-fast-1

Cursor (cu/) - Subscription:

  • cu/claude-4.6-opus-max
  • cu/claude-4.5-sonnet-thinking
  • cu/gpt-5.3-codex
  • cu/kimi-k2.5

GLM (glm/) - $0.6/1M:

  • glm/glm-5.1
  • glm/glm-5
  • glm/glm-4.7

MiniMax (minimax/) - $0.2/1M:

  • minimax/MiniMax-M2.7
  • minimax/MiniMax-M2.5

Kimi (kimi/) - $9/mo flat:

  • kimi/kimi-k2.5
  • kimi/kimi-k2.5-thinking

Kiro (kr/) - FREE unlimited:

  • kr/claude-sonnet-4.5
  • kr/claude-haiku-4.5
  • kr/glm-5
  • kr/MiniMax-M2.5
  • kr/qwen3-coder-next
  • kr/deepseek-3.2

OpenCode Free (oc/) - FREE no-auth:

  • Auto-fetched from opencode.ai/zen/v1/models

Vertex AI (vertex/) - $300 free credits:

  • vertex/gemini-3.1-pro-preview
  • vertex/gemini-3-flash-preview
  • vertex/gemini-2.5-flash
  • vertex-partner/glm-5-maas
  • vertex-partner/deepseek-v3.2-maas

🐛 Troubleshooting

“Language model did not provide messages”

  • Provider quota exhausted → Check dashboard quota tracker
  • Solution: Use combo fallback or switch to cheaper tier

Rate limiting

  • Subscription quota out → Fallback to GLM/MiniMax
  • Add combo: cc/claude-opus-4-7 → glm/glm-5.1 → kr/claude-sonnet-4.5

OAuth token expired

  • Auto-refreshed by 9Router
  • If issues persist: Dashboard → Provider → Reconnect

High costs

  • Enable RTK in Dashboard → Endpoint settings (default ON, saves 20-40% tokens)
  • Check usage stats in Dashboard
  • Switch primary model to GLM/MiniMax
  • Use free tier (Kiro, OpenCode Free, Vertex) for non-critical tasks

Dashboard opens on wrong port

  • Set PORT=20128 and NEXT_PUBLIC_BASE_URL=http://localhost:20128

First login not working

  • Check INITIAL_PASSWORD in .env
  • If unset, fallback password is 123456

No request logs under logs/

  • Set ENABLE_REQUEST_LOGS=true

🛠️ Tech Stack

  • Runtime: Node.js 20+
  • Framework: Next.js 16
  • UI: React 19 + Tailwind CSS 4
  • Database: LowDB (JSON file-based)
  • Streaming: Server-Sent Events (SSE)
  • Auth: OAuth 2.0 (PKCE) + JWT + API Keys

📝 API Reference

Chat Completions

POST http://localhost:20128/v1/chat/completions
Authorization: Bearer your-api-key
Content-Type: application/json

{
  "model": "cc/claude-opus-4-6",
  "messages": [
    {"role": "user", "content": "Write a function to..."}
  ],
  "stream": true
}

List Models

GET http://localhost:20128/v1/models
Authorization: Bearer your-api-key

→ Returns all models + combos in OpenAI format

📧 Support


👥 Contributors

Thanks to all contributors who helped make 9Router better!

Contributors


📊 Star Chart

Star Chart

🔀 Forks

OmniRoute — A full-featured TypeScript fork of 9Router. Adds 36+ providers, 4-tier auto-fallback, multi-modal APIs (images, embeddings, audio, TTS), circuit breaker, semantic cache, LLM evaluations, and a polished dashboard. 368+ unit tests. Available via npm and Docker.


🙏 Acknowledgments

Built on the shoulders of giants:

  • CLIProxyAPI — original Go implementation that inspired this JavaScript port.
  • RTK Stars — Rust token-saver. 9Router ports its compression pipeline to JS → −20-40% input tokens on every request.
  • Caveman Stars by @JuliusBrussee — viral “why use many token when few token do trick”. 9Router adapts its prompt → −65% output tokens.

Huge thanks to these authors — without their work, 9Router’s token-saving features wouldn’t exist. ⭐ them on GitHub!


📄 License

MIT License - see LICENSE for details.


Built with ❤️ for developers who code 24/7

相似文章

decolua/9router

GitHub Trending (daily)

9router 是一款开源工具,通过将各类编程助手与多个 LLM 提供商连接,并具备自动故障转移和 Token 减少功能,从而实现无限免费 AI 编程。

@justloveabit: 用这个开源工具,我让一群AI替我上班了 事情是这样的,最近一直在折腾各种AI agent。Claude Code开一堆窗口,Codex也在跑,偶尔还要用Cursor。结果呢,乱成一锅粥——哪个agent在干啥,花了多少钱,完全搞不清楚。重…

X AI KOLs Timeline

本文介绍了一款名为Paperclip的开源工具,用于统一管理和调度多个AI Agent。它通过模拟公司组织架构、任务分配与预算控制等功能,解决了多Agent协作时上下文丢失、成本不可控和调度混乱的痛点。

@wquguru: new-api从2023年4月至今,已经有3年有余,支撑了大大小小成百上千个中转站,市面上超过90%都是用这个系统 ,而new-api的核心开发者们 @Ion_Mio_ @Seefs_ 则深藏功与名,这篇文章尝试挖掘一下new-api和另…

X AI KOLs Timeline

本文介绍了开源AI模型路由工具New-API自2023年4月发布以来的发展情况,指出其已占据市面上超90%的中转站份额,并探讨了该工具的核心开发者贡献与底层路由算法。

@WY_mask: 目前GitHub热门榜第一,4万+收藏 https://github.com/ruvnet/ruflo 一个“AI调度中心”,可以一次性召唤几十个Agent协同工作,支持多智能体协作、RAG记忆、分布式工作流,甚至还能和Claude Co…

X AI KOLs Timeline

Ruflo(前身为Claude Flow)是一个GitHub热门开源项目,支持同时调度100+专业AI Agent协同工作,具备RAG记忆、分布式工作流和企业安全功能,并可与Claude Code及Codex直接联动。该项目目前在GitHub热门榜排名第一,获得4万+收藏。