semantic-norm-drift

Tag

Cards List
#semantic-norm-drift

The Misattribution Gap: When Memory Poisoning Looks Like Model Failure in Agentic AI Systems

arXiv cs.AI · 2026-05-25 Cached

This paper identifies a structural failure in multi-agent AI pipelines where memory-layer attacks can be misattributed as model misalignment, formalizing Semantic Norm Drift (SND) and proposing Counterfactual Composition Testing and Memory-Persistent Information-Flow Control as defenses.

0 favorites 0 likes
← Back to home

Submit Feedback