attribution-graph

Tag

Cards List
#attribution-graph

Toxic HallucinAItions: Perturbing Prompts and Tracing LLM Circuits

arXiv cs.CL · 3d ago Cached

This paper investigates how toxic lexical perturbations in prompts reduce the factual accuracy and increase uncertainty of LLMs, and uses attribution-graph analyses to trace internal changes. It finds that increasing toxicity amplifies perturbation-sensitive variant nodes while core reasoning nodes remain invariant.

0 favorites 0 likes
#attribution-graph

Why Retrieval-Augmented Generation Fails: A Graph Perspective

arXiv cs.CL · 2026-05-15 Cached

This paper investigates why Retrieval-Augmented Generation (RAG) systems fail despite having access to correct evidence. Using circuit tracing and attribution graphs, the authors find that correct predictions exhibit deeper reasoning paths and more distributed evidence flow, while failures show shallow and fragmented patterns. They propose a graph-based error detection framework and targeted interventions to improve RAG reliability.

0 favorites 0 likes
← Back to home

Submit Feedback