retrieval-augmented

#retrieval-augmented

ContextSniper: AntTrail's Token-Efficient Code Memory for Repository-Level Program Repair

arXiv cs.AI ↗ · 2d ago Cached

ContextSniper is a token-efficient code memory layer for repository-level program repair using LLM agents. It reduces token usage by up to 51.5% and cost by up to 36.4% while maintaining similar resolution rates on SWE-bench Lite.

0 favorites 0 likes

#retrieval-augmented

Verifiable Knowledge Expansion through Retrieval-Grounded Formal Concept Analysis

arXiv cs.AI ↗ · 2d ago Cached

This paper proposes a retrieval-grounded small language model framework that uses formal concept analysis as a symbolic verification loop for ontology construction, demonstrating its effectiveness in a rare ataxia setting.

0 favorites 0 likes

#retrieval-augmented

Auditing Forgetting in Limited Memory Language Models

arXiv cs.CL ↗ · 3d ago Cached

This paper proposes a causal auditing framework to evaluate forgetting in Limited Memory Language Models by varying the database state during inference, discovering that parametric leakage is negligible and post-deletion correctness primarily arises from retrieval artifacts rather than residual parametric memory.

0 favorites 0 likes

#retrieval-augmented

HistoriQA-ThirdRepublic: Multi-Hop Question Answering Corpus for Historical Research, Parliamentary Debates from the French Third Republic (1870-1940)

arXiv cs.AI ↗ · 4d ago Cached

This paper presents HistoriQA-ThirdRepublic, a French-language multi-hop question answering dataset derived from historical documents of the French Third Republic, designed to evaluate retrieval-augmented and LLM systems in historical research contexts.

0 favorites 0 likes

#retrieval-augmented

Retrieval-Warmed Energy-Based Reasoning: A Five-Arm Ablation Methodology for Diffusion-as-Inference on Structured Reasoning Tasks

arXiv cs.LG ↗ · 2026-06-26 Cached

This paper presents a five-arm ablation methodology for diagnosing which component of retrieval-warmed energy-based reasoning (RW-EBR) drives performance gains, applied to structured reasoning tasks like graph reachability and Sudoku. The method separates effects of class-prior bias, stochastic warm-starting, and graph-aligned value reuse.

0 favorites 0 likes

#retrieval-augmented

@yibie: Recommend this article. The teams from SJTU and Tsinghua systematically evaluated 12 agent memory systems. It's not one of those "our model is better" papers but rather breaks down how to choose memory systems from a data management perspective—when to use RAG, when to use vector databases, when to use knowledge graphs. Long-term memory for agents...

X AI KOLs Timeline ↗ · 2026-06-26 Cached

This paper from SJTU and Tsinghua systematically evaluates 12 agent memory systems from a data management perspective, decomposing memory into four modules and providing guidelines on when to use RAG, vector databases, or knowledge graphs for long-term agent memory.

0 favorites 0 likes

#retrieval-augmented

Retrieval-Augmented Personalization with Foundation Models for Wearable Stress Detection

arXiv cs.LG ↗ · 2026-06-25 Cached

This paper introduces a retrieval-augmented personalization method for wearable stress detection using frozen foundation models, achieving near-supervised fine-tuning performance without requiring labeled user data.

0 favorites 0 likes

#retrieval-augmented

RASC+: Retrieval-Constrained LLM Adjudication for Clinical Value Set Authoring

arXiv cs.CL ↗ · 2026-06-24 Cached

This paper introduces RASC+, a retrieval-constrained LLM adjudication method for clinical value set authoring that improves candidate-pool recall and selection precision over prior RASC baselines, demonstrating that blinded LLM adjudication with Qwen3-based retrieval significantly outperforms direct generation.

0 favorites 0 likes

#retrieval-augmented

ScaffoldAgent: Utility-Guided Dynamic Outline Optimization for Open-Ended Deep Research

arXiv cs.AI ↗ · 2026-06-20 Cached

ScaffoldAgent introduces a utility-guided dynamic outline optimization framework for open-ended deep research, using expansion, contraction, and revision operations to improve long-form report generation and factual grounding.

0 favorites 0 likes

#retrieval-augmented

Multi-Agent Transactive Memory

arXiv cs.AI ↗ · 2026-06-20 Cached

Proposes Multi-Agent Transactive Memory (MATM), a framework for population-level storage and retrieval of agent-generated trajectories to improve task performance and reduce interaction steps in interactive environments like ALFWorld and WebArena.

0 favorites 0 likes

#retrieval-augmented

Compositional Skill Routing for LLM Agents: Decompose, Retrieve, and Compose

arXiv cs.CL ↗ · 2026-06-17 Cached

Introduces SkillWeaver, a decompose-retrieve-compose framework for routing multiple skills to LLM agents, along with CompSkillBench, a benchmark of 300 compositional queries over 2,209 real MCP server skills.

0 favorites 0 likes

#retrieval-augmented

An Agentic Retrieval Framework for Autonomous Context-Aware Data Quality Assessment

arXiv cs.AI ↗ · 2026-06-15 Cached

A research paper proposing a unified agentic-retrieval framework for autonomous context-aware data quality assessment. It interprets natural-language usage descriptions, generates executable validation logic via multi-agent workflow, and uses feasibility validation to ensure reliability.

0 favorites 0 likes

#retrieval-augmented

DRIVE: Distributional and Retrieval-Augmented Bidding with Value Evaluation

arXiv cs.LG ↗ · 2026-06-15 Cached

This paper introduces DRIVE, a unified Transformer-based framework for offline auto-bidding that decouples candidate action generation from decision making, combining distributional action modeling, retrieval-augmented candidate generation, and value-based evaluation to improve bidding performance under budget and cost constraints.

0 favorites 0 likes

#retrieval-augmented

Retrieve, Don't Retrain: Extending Vision Language Action Models to New Tasks at Test Time

Hugging Face Daily Papers ↗ · 2026-06-14 Cached

This paper introduces a retrieval-augmented vision-language-action policy that eliminates per-task fine-tuning by using pre-trained models with indexed demonstrations, enabling efficient cross-embodiment generalization and task adaptation at test time.

0 favorites 0 likes

#retrieval-augmented

Sharing for inspiration: Grep for agentic search was a game changer for us.

Reddit r/AI_Agents ↗ · 2026-06-12

Describes improving agentic memory search by incorporating grep-based exact matching alongside vector embeddings, inspired by a paper; achieved significant recall gains in their memory layer.

0 favorites 0 likes

#retrieval-augmented

PersonaDrive: Human-Style Retrieval-Augmented VLA Agents for Closed-Loop Driving Simulation

arXiv cs.AI ↗ · 2026-06-12 Cached

This paper introduces PersonaDrive, a pipeline that conditions a vision-language-action (VLA) driving agent on retrieved demonstrations from a style-instructed human driving dataset, enabling style-diverse non-ego agents for closed-loop simulation and improving driving scores on Bench2Drive.

0 favorites 0 likes

#retrieval-augmented

Less Context, More Accuracy: A Bi-Temporal Memory Engine for LLM Agents Where a Lean Retrieved Context Beats the Full History

arXiv cs.CL ↗ · 2026-06-10 Cached

This paper introduces Engram, an open-source bi-temporal memory engine for LLM agents that retrieves a compact context slice (∼9.6k tokens) to outperform the full-history baseline (79k tokens) by 10.4 accuracy points on LongMemEval, using a hybrid read path fusing dense, lexical, graph, and temporal signals.

0 favorites 0 likes

#retrieval-augmented

I asked how you all handle agent memory. Here's the pattern in the replies, and the one thing nobody's actually solved.

Reddit r/AI_Agents ↗ · 2026-06-09

A community discussion on agent memory reveals that while various patches exist for what to write down (e.g., plain files, layered memory, post-mortems), the unsolved problem is what to keep—detecting failures is tractable, but deciding which lessons persist still needs human judgment.

0 favorites 0 likes

#retrieval-augmented

A Four-Condition Diagnostic Protocol for Evidence Utilization in Long-Context and Retrieval-Augmented Language Models

arXiv cs.CL ↗ · 2026-06-08 Cached

This paper introduces a four-condition diagnostic protocol to separate no-evidence answerability, oracle-evidence recoverability, full-context utilization, and retrieval-conditioned utilization in long-context and retrieval-augmented language models, tested on five open-weight models across multiple datasets.

0 favorites 0 likes

#retrieval-augmented

QueryAgent-R1: Bridging Query Generation and Product Retrieval for E-Commerce Query Recommendation

arXiv cs.CL ↗ · 2026-06-05 Cached

QueryAgent-R1 is an agentic framework that bridges query generation and product retrieval in e-commerce using reinforcement learning and memory abstraction, improving query CTR by 2.9% and CVR by 3.1% in online tests.

0 favorites 0 likes

retrieval-augmented

Submit Feedback