retrieval-augmented

#retrieval-augmented

Sharing for inspiration: Grep for agentic search was a game changer for us.

Reddit r/AI_Agents ↗ · yesterday

Describes improving agentic memory search by incorporating grep-based exact matching alongside vector embeddings, inspired by a paper; achieved significant recall gains in their memory layer.

0 favorites 0 likes

#retrieval-augmented

PersonaDrive: Human-Style Retrieval-Augmented VLA Agents for Closed-Loop Driving Simulation

arXiv cs.AI ↗ · yesterday Cached

This paper introduces PersonaDrive, a pipeline that conditions a vision-language-action (VLA) driving agent on retrieved demonstrations from a style-instructed human driving dataset, enabling style-diverse non-ego agents for closed-loop simulation and improving driving scores on Bench2Drive.

0 favorites 0 likes

#retrieval-augmented

Less Context, More Accuracy: A Bi-Temporal Memory Engine for LLM Agents Where a Lean Retrieved Context Beats the Full History

arXiv cs.CL ↗ · 3d ago Cached

This paper introduces Engram, an open-source bi-temporal memory engine for LLM agents that retrieves a compact context slice (∼9.6k tokens) to outperform the full-history baseline (79k tokens) by 10.4 accuracy points on LongMemEval, using a hybrid read path fusing dense, lexical, graph, and temporal signals.

0 favorites 0 likes

#retrieval-augmented

I asked how you all handle agent memory. Here's the pattern in the replies, and the one thing nobody's actually solved.

Reddit r/AI_Agents ↗ · 4d ago

A community discussion on agent memory reveals that while various patches exist for what to write down (e.g., plain files, layered memory, post-mortems), the unsolved problem is what to keep—detecting failures is tractable, but deciding which lessons persist still needs human judgment.

0 favorites 0 likes

#retrieval-augmented

A Four-Condition Diagnostic Protocol for Evidence Utilization in Long-Context and Retrieval-Augmented Language Models

arXiv cs.CL ↗ · 5d ago Cached

This paper introduces a four-condition diagnostic protocol to separate no-evidence answerability, oracle-evidence recoverability, full-context utilization, and retrieval-conditioned utilization in long-context and retrieval-augmented language models, tested on five open-weight models across multiple datasets.

0 favorites 0 likes

#retrieval-augmented

QueryAgent-R1: Bridging Query Generation and Product Retrieval for E-Commerce Query Recommendation

arXiv cs.CL ↗ · 2026-06-05 Cached

QueryAgent-R1 is an agentic framework that bridges query generation and product retrieval in e-commerce using reinforcement learning and memory abstraction, improving query CTR by 2.9% and CVR by 3.1% in online tests.

0 favorites 0 likes

#retrieval-augmented

Online Skill Learning for Web Agents via State-Grounded Dynamic Retrieval

arXiv cs.AI ↗ · 2026-06-04 Cached

This paper proposes SGDR (State-Grounded Dynamic Retrieval), an online skill learning method for web agents that enables stepwise, state-aware skill reuse rather than static task-level retrieval. Experiments on WebArena show SGDR achieves 37.5% success rate with GPT-4.1, a ~10.6% relative gain over strong baselines.

0 favorites 0 likes

#retrieval-augmented

Show HN: Mnemo – local-first AI memory layer for any LLM (Rust, SQLite,petgraph)

Hacker News Top ↗ · 2026-06-03 Cached

Mnemo is an open-source, local-first memory layer for any LLM that extracts entities and relationships into a persistent knowledge graph using SQLite and petgraph, providing automatic context injection for enhanced conversations.

0 favorites 0 likes

#retrieval-augmented

@SilvioMartinico: The late-interaction multivector retrieval ecosystem is exploding right now. To help separate the signal from the noise…

X AI KOLs Timeline ↗ · 2026-06-02 Cached

A curated list of top models, engines, libraries, and datasets for late-interaction multivector retrieval, organized in an 'Awesome Multivector Retrieval' resource.

0 favorites 0 likes

#retrieval-augmented

SENSE: Semantic Embedding Navigation with Soft-gated Evaluation for Retrieval-based Speculative Decoding

arXiv cs.CL ↗ · 2026-06-02 Cached

Proposes SENSE, a semantic embedding navigation method for retrieval-based speculative decoding that uses hidden states for semantic alignment and soft-gated evaluation, achieving up to 3.26x speedup on LLaMA and Qwen families while preserving generation quality.

0 favorites 0 likes

#retrieval-augmented

ExpGraph: Model-Agnostic Experience Learning with Graph-Structured Memory for LLM Agents

arXiv cs.CL ↗ · 2026-06-01 Cached

ExpGraph is a model-agnostic framework that enables LLM agents to reuse past experiences via a self-evolving graph of skills and failures, improving task performance by 12–21% without retraining the executor.

0 favorites 0 likes

#retrieval-augmented

LLMs Without Deep Neural Networks: New Architecture, Benefits and Case Study

arXiv cs.LG ↗ · 2026-06-01 Cached

This paper presents an alternative architecture for LLMs using Radial Basis Function (RBF) networks that eliminates deep neural networks and finds the global optimum in closed form, requiring no iterative training. It also reviews other non-DNN methods like KANs and k-NN retrieval, with a case study demonstrating increased explainability and faster training.

0 favorites 0 likes

#retrieval-augmented

Context Distillation as Latent Memory Management

arXiv cs.LG ↗ · 2026-05-29 Cached

This paper formulates context distillation as a latent memory management problem, proposing a framework that stores distilled contexts as independent LoRA adapters with retrieval, routing, and self-gating to improve robustness and efficiency.

0 favorites 0 likes

#retrieval-augmented

Micro-Macro Retrieval: Reducing Long-Form Hallucination in Large Language Models

arXiv cs.CL ↗ · 2026-05-29 Cached

This paper introduces Micro-Macro Retrieval (M2R), a retrieve-while-generate framework that reduces hallucination in long-form LLM outputs by ensuring key information stays close to generated text. It uses curriculum learning-based reinforcement learning to train retrieval and grounding skills, showing effectiveness especially in lengthy contexts.

0 favorites 0 likes

#retrieval-augmented

RAG4Outcome: A Retrieval-Augmented Multimodal Framework for Prognostic Prediction in Chronic Osteomyelitis

arXiv cs.AI ↗ · 2026-05-25 Cached

Proposes RAG4Outcome, a retrieval-augmented generation framework integrating multimodal clinical data (PET-CT reports, surgical records, follow-up notes) to improve prognostic prediction in chronic osteomyelitis, enhancing interpretability and clinical reliability.

0 favorites 0 likes

#retrieval-augmented

AI memory systems are becoming harder to trust the longer you use them

Reddit r/AI_Agents ↗ · 2026-05-22

AI memory systems often recall outdated or incorrect information over time, highlighting the challenge of maintaining trust in long-term memory for AI agents.

0 favorites 0 likes

#retrieval-augmented

Claim-Selective Certification for High-Risk Medical Retrieval-Augmented Generation

arXiv cs.CL ↗ · 2026-05-22 Cached

This paper proposes claim-selective certification for high-risk medical retrieval-augmented generation (RAG), decomposing responses into verifiable claims and scoring them against evidence to produce actions (full, partial, conflict, abstain) using an intent-aware selector, achieving low unsupported-claim risk and high action accuracy.

0 favorites 0 likes

#retrieval-augmented

When Cases Get Rare: A Retrieval Benchmark for Off-Guideline Clinical Question Answering

arXiv cs.CL ↗ · 2026-05-22 Cached

Introduces OGCaReBench, a free-form retrieval benchmark for evaluating LLMs on clinical questions that require reasoning beyond standard guidelines. Experiments show that even the best model achieves only 56% accuracy, but retrieval augmentation boosts performance to 82%.

0 favorites 0 likes

#retrieval-augmented

Retrieval-Augmented Long-Context Translation for Cultural Image Captioning: Gators submission for AmericasNLP 2026 shared task

arXiv cs.CL ↗ · 2026-05-21 Cached

University of Florida Gators submission to the AmericasNLP 2026 shared task on cultural image captioning for Indigenous languages, using a two-stage pipeline with Qwen2.5-VL for Spanish captioning and retrieval-augmented Gemini 2.5 Flash for target-language translation, achieving significant improvements over the baseline.

0 favorites 0 likes

#retrieval-augmented

More Context, Larger Models, or Moral Knowledge? A Systematic Study of Schwartz Value Detection in Political Texts

Hugging Face Daily Papers ↗ · 2026-05-21 Cached

A systematic study on detecting Schwartz values in political text, comparing context lengths, model sizes, and retrieval-augmented generation methods. Results show that full-document context improves supervised models but not zero-shot LLMs, while retrieved moral knowledge consistently helps via early fusion.

0 favorites 0 likes

retrieval-augmented

Submit Feedback