line-level-coverage

#line-level-coverage

SWE-Explore: Benchmarking How Coding Agents Explore Repositories

Hugging Face Daily Papers ↗ · 2026-06-05 Cached

SWE-Explore introduces a benchmark for evaluating coding agents' repository exploration capabilities, requiring ranked lists of relevant code regions within line budgets. Experiments show agentic exploration outperforms traditional retrieval, and line-level coverage remains a key differentiator.

0 favorites 0 likes

line-level-coverage

SWE-Explore: Benchmarking How Coding Agents Explore Repositories

Submit Feedback