code-search

#code-search

@geekbb: A CLI tool written in Go that integrates three search capabilities: Web search (Brave/DDG/SearXNG/Exa), code search (Grep/Sourcegraph/GitHub), and library documentation query (Context7). It also supports web scraping and site crawling. For AI...

X AI KOLs Timeline ↗ · 4d ago Cached

A blazing-fast, stateless CLI tool written in Go that integrates Web search, code search, and library documentation query. It supports web scraping and site crawling, designed for AI agents and terminal use.

0 favorites 0 likes

#code-search

@Ryrenz: The token-saving artifact for coding agents has arrived—cocoindex-code, one command for semantic search on your codebase. Just open-sourced, quickly gaining stars with the selling point of "saving 70% tokens". Enabling agents like Claude Code, Codex…

X AI KOLs Timeline ↗ · 4d ago Cached

cocoindex-code is an AST-based semantic code search tool that can be quickly integrated into coding agents, saving up to 70% tokens and improving search efficiency.

0 favorites 0 likes

#code-search

Recall Before Rerank: Benchmarking Deep Learning Models for Large-Scale Code-to-Code Retrieval

arXiv cs.CL ↗ · 5d ago Cached

This paper benchmarks 17 deep learning models for first-stage recall in large-scale code-to-code retrieval, evaluating their precision, efficiency, and scalability across multiple programming languages and datasets. It introduces LLM-based code normalization and query rewriting schemes that improve precision for lower-performing models.

0 favorites 0 likes

#code-search

@Chenzeze777: Guys, I was totally stunned scrolling through GitHub today. Headroom gained 14k stars in a week, absolutely blowing up in the overseas developer circle. I initially thought it was just another PPT open-source project, but after a close look at the real-world test data—code search compressed from 17k tokens to 1,400, with the answer unchanged word for word. Let me...

X AI KOLs Timeline ↗ · 2026-06-08 Cached

Headroom is an open-source tool that compresses token usage in code search results and AI conversations by up to 92% (e.g., from 17k to 1,400 tokens) while maintaining answer quality. It supports multiple platforms and runs locally for free.

0 favorites 0 likes

#code-search

I built an open-source coding agent that makes context visible and editable — you curate exactly what the LLM sees

Reddit r/AI_Agents ↗ · 2026-05-31

The author built Nice Coding Agent, an open-source coding workbench with a visible and editable context stack, allowing users to curate exactly what the LLM sees. It features local-first retrieval, sandboxed execution, and hybrid code search, aiming to give developers control and visibility over context assembly.

0 favorites 0 likes

#code-search

@Trtd6Trtd: https://github.com/MinishLab/semble… High-speed code search library specialized for AI Compared to grep + reading, it s…

X AI KOLs Timeline ↗ · 2026-05-20 Cached

Semble 是一个面向 AI 代理的高效代码搜索库，使用模型如 Model2Vec 或 BM25 实现快速索引和检索，比 grep+read 节省约 98% 的 token，支持 MCP 服务器和 CLI 集成。

0 favorites 0 likes

#code-search

@aigclink: An Agent-oriented code search tool: Semble. It uses natural language to search codebases and returns precise code snippets, saving 98% token consumption compared to grep+read. The method lets Agents use natural language to directly locate the most relevant lines of code, without guessing keywords or reading entire files. Speed: indexing a typical…

X AI KOLs Timeline ↗ · 2026-05-19 Cached

Semble is an Agent-oriented code search tool that supports natural language queries, accurately returns semantically complete code snippets, saves 98% token consumption compared to traditional grep+read methods, and features intelligent chunking, dual-path retrieval, and code-aware re-ranking.

0 favorites 0 likes

#code-search

Built a local-first context engine for AI coding agents — symbol graph + semantic search, no cloud

Reddit r/artificial ↗ · 2026-05-18

Argyph is an open-source MCP server that provides AI coding agents with structured codebase understanding via a symbol graph and semantic search, running entirely locally with no cloud dependencies.

0 favorites 0 likes

#code-search

Show HN: Semble – Code search for agents that uses 98% fewer tokens than grep

Hacker News Top ↗ · 2026-05-17 Cached

Semble is a fast code search library for AI agents that uses ~98% fewer tokens than grep+read, runs on CPU with no external dependencies, and integrates via MCP or CLI.

0 favorites 0 likes

#code-search

Beyond Retrieval: A Multitask Benchmark and Model for Code Search

Hugging Face Daily Papers ↗ · 2026-05-06 Cached

This paper introduces CoREB, a contamination-limited multitask benchmark for code search that evaluates text-to-code, code-to-text, and code-to-code retrieval with fine-tuned reranking capabilities.

0 favorites 0 likes

#code-search

New and improved embedding model

OpenAI Blog ↗ · 2022-12-15 Cached

OpenAI released text-embedding-ada-002, a unified embedding model that consolidates five previous models into one with superior performance, 4x longer context (8192 tokens), smaller dimensionality (1536), and 99.8% lower pricing than previous Davinci embeddings.

0 favorites 0 likes

#code-search

Introducing text and code embeddings

OpenAI Blog ↗ · 2022-01-25 Cached

OpenAI introduces a new embeddings API endpoint that converts text and code into numerical vector representations for semantic search, clustering, and classification tasks. The models achieve state-of-the-art results on standard benchmarks including a 20% relative improvement in code search performance.

0 favorites 0 likes

#code-search

Text and code embeddings by contrastive pre-training

OpenAI Blog ↗ · 2022-01-24 Cached

OpenAI presents a contrastive pre-training approach for generating high-quality text and code embeddings at scale without supervision, achieving state-of-the-art results on linear-probe classification, semantic search, and code search benchmarks.

0 favorites 0 likes

#code-search

@sourcebot_dev: Give agents context across your entire codebase. Open source, 1 min install.

X AI KOLs Following ↗ · 2026-04-21 Cached

Sourcebot has launched an open-source MCP (Model Context Protocol) server that connects AI coding agents like Cursor, Claude Code, and Copilot to an entire codebase for search, file reading, and reference resolution. It supports OAuth 2.0 and API key authorization with a quick 1-minute install.

0 favorites 0 likes

code-search

Submit Feedback