bm25

#bm25

@alex_prompter: My agents kept getting dumber every time I gave them more tools. The reason is mechanical. Every MCP server you connect…

X AI KOLs Timeline ↗ · 6d ago Cached

Ratel is an open-source tool that reduces input tokens by 79% and improves tool selection accuracy for AI agents by loading only needed tools using a BM25 index, instead of all available tools.

0 favorites 0 likes

#bm25

AutoIndex: Learning Representation Programs for Retrieval

Hugging Face Daily Papers ↗ · 2026-07-21 Cached

AutoIndex is a framework that learns executable representation programs to transform documents before indexing, improving retrieval quality over static BM25 baselines by average 8.4% Recall@100 and 8.3% nDCG@10.

0 favorites 0 likes

#bm25

I benchmarked my reasoning-based retrieval system against FAISS and BM25 on 700 queries, running everything on local Qwen. Results + where it loses

Reddit r/AI_Agents ↗ · 2026-07-09

A reasoning-based retrieval system (ClawIndex) is benchmarked against FAISS and BM25 on 700 queries using local Qwen, achieving higher NDCG@10 but slower speed and lower MRR on some datasets, with the author seeking feedback.

0 favorites 0 likes

#bm25

@johnsonshi86: https://x.com/johnsonshi86/status/2072112215097024961

X AI KOLs Timeline ↗ · 2026-07-01 Cached

Describes DR-DCI, an optimization that combines RAG with bash commands on a virtual filesystem to enable agents to perform precise corpus retrieval, and discusses scaling to distributed systems for inference providers.

0 favorites 0 likes

#bm25

How Lume Works: The Retrieval Primitives

Hacker News Top ↗ · 2026-06-20 Cached

Technical deep dive into Lume, a Rust hybrid search engine that combines BM25, dense vectors, and entity graphs for auditable, local-first retrieval.

0 favorites 0 likes

#bm25

When Rules Learn: A Self-Evolving Agent for Legal Case Retrieval

arXiv cs.AI ↗ · 2026-06-17 Cached

This paper introduces a self-evolving framework that uses an LLM-based agent to iteratively create and refine query rewriting rules for BM25 in legal case retrieval, outperforming non-evolutionary baselines on the LeCaRD-v2 benchmark without any parameter training.

0 favorites 0 likes

#bm25

Hybrid retrieval + dependency-graph expansion beats embeddings-only for code RAG — measured, CI-gated

Reddit r/AI_Agents ↗ · 2026-06-16

Archex is a new open-source code RAG tool that improves retrieval by combining hybrid search (BM25F + dense embeddings), cross-encoder reranking, and dependency-graph expansion, achieving much higher recall and token efficiency than pure embeddings-based approaches.

0 favorites 0 likes

#bm25

Why I stopped using semantic embeddings for tool selection and switched back to BM25 [D]

Reddit r/MachineLearning ↗ · 2026-06-08

The author shares their experience switching from semantic embeddings to BM25 for tool selection in agents, finding that BM25 achieves 81% top-1 accuracy vs. 64% for embeddings on a corpus of 200 query-tool pairs, because tool descriptions are short and keyword-driven rather than semantically rich like documents.

0 favorites 0 likes

#bm25

your RAG app isn't broken because of the model

Reddit r/ArtificialInteligence ↗ · 2026-06-08

A developer shares that the real problem in a RAG app was the retrieval step failing on version numbers and codes, fixed by hybrid search (vector + BM25 + reciprocal rank fusion), not the model.

0 favorites 0 likes

#bm25

Towards Retrieving Interaction Spaces for Agentic Search

Hugging Face Daily Papers ↗ · 2026-06-05 Cached

RISE framework constructs bounded interaction spaces for agentic search by combining BM25 retrieval with preprocessed document indexing, enabling efficient corpus exploration while maintaining high accuracy at scale.

0 favorites 0 likes

#bm25

Training-Free Lexical-Dense Fusion for Conversational-Memory Retrieval

arXiv cs.LG ↗ · 2026-06-04 Cached

This paper proposes a training-free, CPU-only retrieval method that fuses BM25 lexical scores with late-interaction dense scores for conversational memory retrieval, achieving up to +17.2 points improvement on LoCoMo Hit@1 over late interaction alone across six encoders. The study provides controlled ablations on pooling operators, reranker effects, and benchmark robustness, framing the gain as a division of labor between dense and lexical signals.

0 favorites 0 likes

#bm25

is [ BM25 + vector ]+ RRF really worth it?

Reddit r/AI_Agents ↗ · 2026-06-03

This post questions whether combining BM25 and vector search with RRF improves hit rates in agentic memory retrieval, suggesting BM25 alone may suffice.

0 favorites 0 likes

#bm25

@mixedbreadai: By now, everyone knows that single-vector embedding models are hugely limiting for modern workflows. But they contain t…

X AI KOLs Following ↗ · 2026-06-02 Cached

Single-vector embedding models can be used to extract sparse latent terms, and BM25 can turn this vocabulary into a strong retriever.

0 favorites 0 likes

#bm25

spent way too long debugging RAG before realizing the chunking was the problem the whole time

Reddit r/ArtificialInteligence ↗ · 2026-06-01

A developer recounts debugging RAG systems, discovering that fixed-size chunking breaks sentence boundaries, vector search fails for exact identifiers (solved with BM25), and stale indexes cause confident wrong answers.

0 favorites 0 likes

#bm25

@bclavie: Very excited to finally share this one after sitting on it for far too long! It's very topical now. Blog post coming ve…

X AI KOLs Timeline ↗ · 2026-05-30 Cached

Researchers extract indexable, BM25-ready sparse features from frozen dense retrievers using reconstruction-trained sparse autoencoders.

0 favorites 0 likes

#bm25

@_reachsumit: Latent Terms: Dense Retrievers Contain Trivially Extractable BM25-ready Zipfian Vocabularies @bclavie et al. extract in…

X AI KOLs Following ↗ · 2026-05-29 Cached

The paper proposes Latent Terms, a method using Sparse Autoencoders to extract BM25-ready sparse features from frozen dense retrievers, achieving competitive performance without retrieval-specific training.

0 favorites 0 likes

#bm25

@jerryjliu0: Real question: what is the actual latest state-of-the-art for file search and retrieval? - Actual grep over filesystem …

X AI KOLs Following ↗ · 2026-05-18 Cached

Jerry Liu asks about the current state-of-the-art for file search and retrieval, listing options from grep to hybrid search over a database.

0 favorites 0 likes

#bm25

@rwayne: Context Mode solves the other half of AI Agent context issues: sandboxed tool outputs + persistent sessions. A 56 KB Playwright snapshot compressed to 299 bytes, 98% of data never entering the context. Every file edit, Git operation, task decision is stored into…

X AI KOLs Timeline ↗ · 2026-05-12 Cached

Context Mode is a tool that solves AI agent context problems by sandboxing tool outputs and persisting sessions, achieving up to 98% compression of Playwright snapshots and using BM25 retrieval to reduce context window usage. It supports 15 platforms including Claude Code, Gemini CLI, VS Code Copilot, and is used by major tech companies.

0 favorites 0 likes

#bm25

Rethinking Agentic Search with Pi-Serini: Is Lexical Retrieval Sufficient?

Hugging Face Daily Papers ↗ · 2026-05-11 Cached

This paper introduces Pi-Serini, a BM25-based agentic search system that demonstrates lexical retrieval can suffice for deep search when agents refine queries, achieving high accuracy and reducing costs compared to default settings.

0 favorites 0 likes

#bm25

@DailyDoseOfDS_: Stop using vector search everywhere! A 30-year-old algorithm with zero training, zero embeddings, and zero fine-tuning …

X AI KOLs Timeline ↗ · 2026-05-07

The article argues against overusing vector search, highlighting BM25's effectiveness for exact keyword matching and its role in hybrid search systems.

0 favorites 0 likes

bm25

Submit Feedback