memory-poisoning

#memory-poisoning

The Misattribution Gap: When Memory Poisoning Looks Like Model Failure in Agentic AI Systems

arXiv cs.AI ↗ · 2026-05-25 Cached

This paper identifies a structural failure in multi-agent AI pipelines where memory-layer attacks can be misattributed as model misalignment, formalizing Semantic Norm Drift (SND) and proposing Counterfactual Composition Testing and Memory-Persistent Information-Flow Control as defenses.

0 favorites 0 likes

#memory-poisoning

MemAudit: Post-hoc Auditing of Poisoned Agent Memory via Causal Attribution and Structural Anomaly Detection

arXiv cs.AI ↗ · 2026-05-25 Cached

MemAudit is a post-hoc auditing framework for memory-augmented LLM agents that identifies poisoned memories by combining counterfactual influence scores and structural anomaly detection, reducing attack success rates from over 70% to 0% in realistic scenarios.

0 favorites 0 likes

memory-poisoning

The Misattribution Gap: When Memory Poisoning Looks Like Model Failure in Agentic AI Systems

MemAudit: Post-hoc Auditing of Poisoned Agent Memory via Causal Attribution and Structural Anomaly Detection

Submit Feedback