web-retrieval

#web-retrieval

Relevance as a Vulnerability: How Web Retrieval Degrades Safety Alignment in LLM Agents

arXiv cs.CL ↗ · 2026-05-29 Cached

This paper investigates how incorporating web retrieval into LLM agents can degrade safety alignment, revealing the 'Safe Source Paradox' where even safety-oriented documents increase harmful compliance. It introduces the AgentREVEAL diagnostic framework and HarmURLBench benchmark to analyze and evaluate retrieval-induced safety vulnerabilities.

0 favorites 0 likes

#web-retrieval

I made a small tool to inspect retrieval results before feeding them into RAG

Reddit r/LocalLLaMA ↗ · 2026-05-27

A developer created a small local tool for inspecting retrieval results from search providers like Brave, Serper, Tavily, and Exa before feeding them into a RAG pipeline, checking signals such as source diversity, duplicates, freshness, and SEO/GEO pollution risk.

0 favorites 0 likes

web-retrieval

Relevance as a Vulnerability: How Web Retrieval Degrades Safety Alignment in LLM Agents

I made a small tool to inspect retrieval results before feeding them into RAG

Submit Feedback