Why Vector RAG fails for AI coding agents at scale (And how I used a Neo4j graph to fix it)
Summary
A new open-source tool called Writ uses a hybrid retrieval pipeline with BM25, ONNX vectors, and Neo4j graph traversals to provide context rules for AI coding agents, reducing token bloat by 726x and enforcing plan approval via bash hooks.
Similar Articles
How I wired a Graph DB on top of my vector store to scale 1K agents for 2 months, because vector search alone fails when user preferences change over time.
A detailed architectural guide for building long-running AI agents that handle changing user preferences over time by combining a vector store, graph DB, and temporal edges instead of overwriting data.
@techwith_ram: Your AI stack has a database problem You need a vector DB for embeddings. A graph DB for relationships. An application …
HelixDB is a new open-source database built in Rust that combines vector, graph, and other data models into a single engine, backed by Y Combinator. It aims to replace separate vector, graph, and application databases for AI stacks, offering native vector search, graph traversal, and MCP support.
We tried vectors, ASTs, and brute-force context stuffing for code retrieval. Graphs with LLM-generated semantics worked best. Here's what we learned.
The authors detail their experience building a code indexing system, concluding that graph-based retrieval with LLM-generated semantics outperforms vector embeddings and pure AST parsing. They open-sourced the system, Bytebell, which uses Neo4j to store semantic context for efficient and precise code retrieval.
Building Agentic GraphRAG Systems: From knowledge graphs and ontologies to a unified memory as an MCP server for your AI agent.
The author argues that GraphRAG is fundamentally a data modeling problem rather than just a retrieval algorithm, proposing a five-component architecture using ontologies, knowledge graphs, and an MCP server for unified agent memory.
We built and open-sourced Caliby: An embedded, high-performance vector database for AI Agents (Beats pgvector by 4x, outperforms FAISS on disk)
Caliby is an open-sourced embedded vector database co-developed by Sea-Land AI and MIT's Michael Stonebraker team, offering high-performance vector retrieval (4x faster than pgvector) with HNSW, DiskANN, and IVF+PQ indexes, designed specifically for AI Agent and RAG use cases with a simple pip install.