@GitTrend0x: Claude Code Codebase Smart Brain — 27× Token Savings Killer Open-Source Tool https://github.com/repowise-dev/repowise… This is Repowise, a codebase intelligence platform built for AI-assisted engineering teams! It turns...
Summary
Repowise is an open-source tool that indexes codebases into four intelligence layers (dependency graph, git history, auto-documentation, architectural decisions) and exposes them via seven MCP tools to AI coding agents like Claude Code, achieving up to 27× token savings while maintaining answer quality.
View Cached Full Text
Cached at: 05/15/26, 05:05 PM
Claude Code Codebase Intelligent Brain: 27× Token Saving, Killer Open-Source Tool.
https://github.com/repowise-dev/repowise…
This is Repowise — a codebase intelligence platform built for AI‑assisted engineering teams. It indexes your entire codebase (even multiple repositories) into four intelligence layers in one shot: dependency graph, Git history, auto‑generated documentation, and architectural decisions. Then, through seven MCP tools, it feeds directly into Claude Code or any MCP‑compatible agent — so AI no longer blindly reads entire files, but truly understands why, who changed it, where it’s dead, and how things are connected. Your codebase finally has a real brain.
repowise-dev/repowise
Source: https://github.com/repowise-dev/repowise
The codebase intelligence layer for your AI coding agent.
Four intelligence layers. Seven MCP tools. Multi-repo workspaces. Auto-sync hooks. One pip install.
PyPI version (https://pypi.org/project/repowise/) · License: AGPL v3 (https://www.gnu.org/licenses/agpl-3.0) · Python (https://pypi.org/project/repowise/) · MCP (https://modelcontextprotocol.io) · Stars (https://github.com/repowise-dev/repowise)
Explore real codebases → (https://www.repowise.dev/explore) · Hosted for teams → (https://www.repowise.dev/#contact) · Docs (https://docs.repowise.dev) · Discord (https://discord.gg/cQVpuDB6rh) · Contact
Your AI coding agent reads files. It doesn’t know which ones change together, which ones are dead, or why they were built the way they were. It has the source code and no memory of how the codebase got there.
repowise fixes that. It indexes your codebase into four intelligence layers — dependency graph, git history, auto-generated documentation, and architectural decisions — and exposes them to Claude Code (and any MCP-compatible AI agent) through seven precisely designed tools.
Multi-repo? Initialize a workspace and get cross-repo co-change detection, API contract extraction, and federated MCP queries across all your services.
27× fewer tokens per query. 36% cheaper. Same answer quality.
The result: your agent answers “why does auth work this way?” instead of “here is what auth.ts contains.”
🏆 Benchmarked against frontier LLMs
repowise-bench → (https://github.com/repowise-dev/repowise-bench) — an open SWE-QA benchmark that grades how well standard LLMs answer real software-engineering questions over real repositories.
On 48 paired tasks from
pallets/flask(claude-sonnet-4-6, end-to-end), repowise-augmented Claude Code matches baseline answer quality while being dramatically leaner:
Metric (per task, mean) Baseline + repowise Δ 💰 Cost $0.1396 $0.0890 −36 % ⚡ Wall time 41.7 s 33.9 s −19 % 🛠️ Tool calls 7.4 3.8 −49 % 📄 Files read 1.9 0.2 −89 % 32 / 48 (67 %) tasks are cheaper with repowise — at parity quality (judge Δ ≈ −0.01).
Token efficiency — because context windows aren’t free
There’s a small genre of “token efficiency” benchmarks going around. It would be impolite not to contribute one. Ours runs on the 30 most recent non-merge commits of pallets/flask and asks one question: to understand a commit, how many tokens does each strategy ask the model to read?
| Strategy | Tokens / commit |
|---|---|
| Naive (full contents of changed files) | 64,039 |
git diff only | 14,888 |
repowise get_context | 2,391 |
209× less than naive (mean), 26.8× pooled, 1,214× best case. 41.7× less than git diff (mean), 6.2× pooled. Same file list, same tokenizer (cl100k_base), no per-strategy fudge. We report mean, pooled, and median together because picking just one would be the kind of thing other people in this genre seem to do.
Full methodology, per-task tables, and the actual SWE-QA evaluation (which has third-party ground truth and an independently-scored LLM judge — unlike this sanity-check): repowise-bench → (https://github.com/repowise-dev/repowise-bench)
What repowise builds
repowise runs once, builds everything, then keeps it in sync on every commit.
◈ Graph Intelligence
tree-sitter parses every file across 14 languages into a two-tier dependency graph — file nodes and symbol nodes (functions, classes, methods). A 3-tier call resolver with confidence scoring handles import aliases, barrel re-exports, and namespace imports. Heritage extraction covers extends, implements, trait impls, derive macros, mixins, and extension conformance. Leiden community detection finds logical modules even when your directory structure doesn’t reflect them. PageRank, betweenness centrality, SCC analysis, and execution flow tracing from entry points identify your most central, most coupled, and most traversed code.
◈ Git Intelligence
Your git history turned into signals: hotspot files (high churn × high complexity), ownership percentages per author, co-change pairs (files that change together without an import link — hidden coupling), and significant commit messages that explain why code evolved.
Rolled up into contributor profiles (per-author module rollups, top files, co-authors, silo modules, dead-code burden), module health scorecards (composite score over churn × ownership × docs × dead code × bus factor), and reviewer suggestions for any PR file list, weighted by direct authorship, co-change history, and recency.
◈ Documentation Intelligence
An LLM-generated wiki for every module and file, rebuilt incrementally on every commit. Coverage tracking. Freshness scoring per page. Semantic search via RAG. Confidence scores show how current each page is relative to the underlying code.
◈ Decision Intelligence
The layer nobody else has. Architectural decisions captured from git history, inline markers, and explicit CLI — linked to the graph nodes they govern, tracked for staleness as code evolves.
# WHY: JWT chosen over sessions — API must be stateless for k8s horizontal scaling
# DECISION: All external API calls wrapped in CircuitBreaker after payment provider outages
# TRADEOFF: Accepted eventual consistency in preferences for write throughput
These become structured decision records, queryable by Claude Code via get_why().
Quickstart
pip install repowise
Or install the CLI into an isolated, uv-managed environment:
uv tool install repowise
Single repo
cd your-project
repowise init # builds all four intelligence layers (~25 min first time)
repowise serve # starts MCP server + local dashboard
Multi-repo workspace
cd my-workspace/ # parent dir containing backend/, frontend/, shared-libs/
repowise init . # scans for git repos, indexes each, runs cross-repo analysis
repowise serve # workspace dashboard + per-repo pages
That’s it. repowise init automatically registers the MCP server, installs PreToolUse/PostToolUse hooks in ~/.claude/settings.json, generates .mcp.json at the project root, and offers to install a post-commit git hook that keeps everything in sync after every commit. See Auto-Sync for all sync methods (hooks, file watcher, GitHub/GitLab webhooks, polling).
To manually add the MCP server to another editor:
{
"mcpServers": {
"repowise": {
"command": "repowise",
"args": ["mcp", "/path/to/your/project"]
}
}
}
Note on init time: Initial indexing analyzes your entire codebase — AST parsing, git-history mining, LLM doc generation, embedding indexing, and decision archaeology. This is a one-time cost (~25 minutes for a 3,000-file project). Every subsequent update after a commit takes under 30 seconds and only regenerates the few pages affected by your changes.
Full docs: Quickstart · User Guide · CLI Reference · MCP Tools · Workspaces · Computed Glossary · Auto-Sync
Workspaces — multi-repo intelligence
Most codebases aren’t one repo. repowise workspaces let you index and query multiple repositories together — with cross-repo intelligence that single-repo tools can’t provide.
cd my-workspace/ # backend/, frontend/, shared-libs/ under one parent
repowise init . # scan, select repos, index each, run cross-repo analysis
What you get on top of per-repo intelligence:
| Feature | What it does |
|---|---|
| Cross-repo co-changes | Finds files across repos that change in the same time window — e.g., backend/api/routes.py and frontend/src/api/client.ts always move together |
| API contract extraction | Scans for HTTP route handlers (Express, FastAPI, Spring, Go), gRPC service defs, and message topic publishers/subscribers — then matches providers with consumers across repos |
| Package dependency mapping | Reads package.json, pyproject.toml, go.mod, pom.xml to detect when one repo depends on another as a package |
| Federated MCP queries | One MCP server serves all repos. Pass repo="backend" or repo="all" to any tool |
| Workspace dashboard | Aggregate stats, repo cards, contract links, co-change pairs — all in the web UI |
| Workspace CLAUDE.md | Auto-generated context file covering all repos, their relationships, and cross-repo signals |
Workspace CLI:
repowise workspace list # show all repos and their status
repowise workspace add ../new-service # add a repo (auto-indexes + docs by default)
repowise workspace remove api-gateway # remove a repo (doesn't delete files)
repowise workspace scan # re-scan for new repos
repowise update --workspace # update all stale repos + first-time index any new ones
repowise update --repo backend # scope to one repo (auto-detected from cwd too)
repowise watch --workspace # auto-update all repos on file change
repowise doctor --workspace --repair # validate every repo; sync state drift; drop dead entries
repowise hook install --workspace # install post-commit hooks for all repos
Most commands also accept --no-workspace to force single-repo mode and --repo <name> to scope to one repo. See CLI Reference.
Full guide: docs/WORKSPACES.md
Seven MCP tools
Most tools are designed around data entities — one module, one file, one symbol — which forces AI agents into long chains of sequential calls. repowise tools are designed around tasks. Pass multiple targets in one call. Get complete context back.
Full reference: docs/MCP_TOOLS.md
| Tool | What it answers | When Claude Code calls it |
|---|---|---|
get_overview() | Architecture summary, module map, entry points, git health, community summary | First call on any unfamiliar codebase |
get_answer(question) | One-call RAG: retrieves over the wiki, gates on confidence, and synthesizes a cited 2–5 sentence answer. High-confidence answers cite directly; ambiguous queries return ranked excerpts. | First call on any code question — collapses search → read → reason into one round-trip |
get_context(targets, include?) | The workhorse. Docs, symbols, ownership, freshness, community membership for any targets. include options: "source" (symbol body), "callers"/"callees" (call graph), "metrics" (PageRank, centrality), "community" (cluster membership). Batch multiple targets. In workspace mode, pass repo to target a specific repo. | Before reading or modifying code. Pass all relevant targets in one call. |
search_codebase(query) | Semantic search over the full wiki. Natural language. In workspace mode, searches across all repos. | When get_answer returned low confidence and you need to discover candidate pages by topic |
get_risk(targets?, changed_files?) | Hotspot scores, dependents, co-change partners, blast radius, recommended reviewers, test gaps, security signals, 0–10 risk score | Before modifying files — understand what could break |
get_why(query?) | Three modes: NL search over decisions · path-based decisions for a file · no-arg health dashboard | Before architectural changes — understand existing intent |
get_dead_code(min_confidence?, include_internals?) | Unreachable code sorted by confidence tier with cleanup impact estimates | Cleanup tasks |
Tool call comparison — a real task
“Add rate limiting to all API endpoints.”
| Approach | Tool calls | Time to first change | What it misses |
|---|---|---|---|
| Claude Code alone (no MCP) | grep + read ~30 files | ~8 min | Ownership, prior decisions, hidden coupling |
| repowise (7 tools) | 5 calls | ~2 min | Nothing |
The 5 calls for that task:
get_overview() # orient: understand the architecture
get_context(["middleware", "api/routes", "payments"]) # understand 3 modules at once
get_risk(["middleware/auth.ts"]) # assess: 47 dependents, co-changes
get_why("rate limiting") # check: any prior decision?
search_codebase("rate limit OR throttle OR retry") # find: any prior implementation?
How Claude Code uses it
User: Implement rate limiting on all API endpoints
Claude Code:
→ get_overview()
"Express API. Entry points in api/routes/. Middleware in middleware/."
→ get_context(["middleware", "api/routes", "payments"])
middleware/: existing chain is cors → auth → routes. Owner: @alex.
api/routes/: 23 route files. No existing rate limiting.
payments/: Owner @sarah (71%). Decision: all side effects must be idempotent.
→ get_why("rate limiting")
"No prior decision found. No prior implementation detected."
→ get_risk(["middleware/auth.ts"])
"47 files import this. Co-changes with all 4 service listeners.
Risk summary: any interface change here touches 47 dependents."
→ search_codebase("rate limit throttle retry")
"Found: payments/retry.ts already has RetryQueue class.
Found: payments/middleware.ts has idempotency key middleware."
Implementing rate-limiting middleware, inserting after cors, before auth. Will also update tests/middleware.test.ts — detected as historical co-change partner. Flagging payments/ for @sarah review — hotspot, high ownership concentration.
This is what happens when an AI agent has real codebase intelligence.
Local dashboard
repowise serve starts a full web UI alongside the MCP server. No separate setup — browse your codebase intelligence directly in the browser.
| View | What it shows |
|---|---|
| Chat | Ask anything about your codebase in natural language |
| Docs | AI-generated wiki with syntax highlighting, Mermaid diagrams, and a graph intelligence sidebar showing PageRank/betweenness percentiles, community membership, and degree |
| Graph | Interactive dependency graph — handles 2,000+ nodes. Community color mode with real labels, community detail panel on click, path finder |
| Search | Full-text and semantic search with global command palette (Ctrl+K) |
| Symbols | Searchable index of every function, class, and method, server-ranked by importance (PageRank × visibility × complexity × kind × entry-point). Filter facets for public-only, language, kind, in-hotspot-files, and in-entry-point-files. Click any symbol for graph metrics, callers/callees, class heritage, and a git panel with governing decisions |
| Coverage | Doc freshness per file with one-click regeneration |
| Risk | One page, six tabs: Hotspots (with inline drill-down to the top importance-ranked symbols in each file), Heatmap (ownership treemap with bus-factor borders and silo overlay), Module Health, Dead Code, Impact (blast radius), and the always-visible risk strip linking to each |
| Contributors | Paginated contributor directory and per-author profile pages — modules they own, top files, co-authors, commit category mix, silo modules, bus-factor risk files, and dead-code burden. Click any owner anywhere in the dashboard to drill in |
| Module Health | Per-module rollup with a composite health score (churn / ownership / docs / dead code / bus factor), sortable list, and a per-module detail page showing owners, top hotspots, and governing decisions |
| Hotspots | Ranked by trend-weighted score (180-day decay) and churn. Paginated load-more; each row expands inline to the most important symbols in that file |
| Dead Code |
Similar Articles
@GitTrend0x: Reduce Claude Code tool calls by 94% with this local code knowledge graph powerhouse: https://github.com/Jakedismo/codegraph-rust… This is CodeGraph-Rust, an MCP server that proactively indexes your entire codebase into…
CodeGraph-Rust is a tool based on the MCP protocol that indexes codebases into a local knowledge graph. It claims to reduce tool calls by 94% for AI agents like Claude Code, significantly boosting speed and saving tokens.
@DeRonin_: Do you understand what Repowise just open-sourced??? 50% less cost and 4x faster by giving your AI a map of your codeba…
Repowise is an open-source tool that indexes codebases into four intelligence layers (AST, dependency graph, git history, and LLM-generated wiki) and exposes them as MCP tools, enabling AI coding agents like Claude Code to understand code architecture and history, reducing costs by 50% and speeding up tasks by 4x.
@VincentLogic: This open-source project cuts Claude Code's costs by 25%. It doesn't build new models or a new IDE. It just draws a "code map" for the AI coding agent. Traditional approach: the model reads the entire repo → token explosion. Its approach: first parse the code with Tree-si…
An open-source project uses Tree-sitter to parse code into a graph structure and store it in local SQLite, providing a code map for AI coding agents, thereby reducing token consumption and costs. On average, it saves 57% tokens and reduces costs by 25%. Supports tools like Claude Code, Cursor, aider, etc.
@justloveabit: https://x.com/justloveabit/status/2055263377006747820
Introducing the new version of Claude Code 2.1.142 in combination with CodeGraph and MCP, which greatly improves the efficiency of exploring large codebases through a local semantic knowledge graph, with a 92% reduction in tool calls and a 71% speed improvement.
@GitHub_Daily: When developing a project with Claude Code, if the codebase is large, every exploration of the code structure requires scanning a bunch of files, resulting in many tool calls, slow speed, and heavy token usage. So I found CodeGraph, an open-source tool that pre-builds a semantic knowledge graph for the codebase, allowing Claude Code to query the graph directly instead of scanning files one by one...
CodeGraph is an open-source tool that pre-builds a semantic knowledge graph for codebases, allowing Claude Code to query the graph instead of scanning files one by one, thereby significantly reducing tool calls (by 92%) and improving exploration speed (by 71%). It supports 19 programming languages and 13 frameworks.