A VERY lightweight open web-search tool for smaller local LLMs

Reddit r/LocalLLaMA Tools

Summary

TinySearch is an open-source MCP tool that performs web search, crawls pages, and retrieves/re-ranks relevant chunks to provide a concise context for smaller local LLMs, avoiding context pollution from full page dumps.

Hey everyone, Been playing around with local agent setups lately, mostly Cline/Roo with smaller models, and web search kept annoying me. Not because it doesn’t work, but because it usually throws way too much random page text into the context. small models really don’t handle that gracefully lol. they start with a simple search and suddenly half the prompt is scraped garbage. So I built bad boy, TinySearch. It’s a small open-source MCP tool that does web search, crawls a few pages, chunks/retrieves/reranks the useful bits, and gives the agent a much smaller context blob instead of dumping full pages. Repo: [https://github.com/MarcellM01/TinySearch](https://github.com/MarcellM01/TinySearch) Uses DuckDuckGo, Crawl4AI, dense + BM25-style retrieval, reranking, MCP, and it can also run as a FastAPI server. On my setup (M4 Mac and old ahh lenovo thinkpad) it usually takes around 5–12 seconds end to end, depending on the query/machine Not trying to replace real search infra or anything. it’s more just a little local research layer for people building agents who don’t want to spin up a whole backend just to let the model look stuff up. Still rough in places, but it’s been useful enough for my own workflows that I figured I’d share it. Feedback/roasting welcome, especially from people using Cline, Roo, MCP, or smaller local models.
Original Article

Similar Articles

Markdown browser for LLMs

Reddit r/LocalLLaMA

The author introduces TextWeb, an open-source tool that renders web pages as markdown for LLMs instead of using expensive vision models, featuring CLI and MCP server support.

LearningCircuit/local-deep-research

GitHub Trending (daily)

A privacy-focused local deep research tool that supports various LLMs and search engines to achieve high accuracy on QA tasks while keeping data encrypted and local.