causal-intervention

#causal-intervention

Dismantling Pathological Shortcuts: A Causal Framework for Faithful LVLM Decoding

arXiv cs.AI ↗ · 2d ago Cached

This paper reveals that hallucination in large vision-language models is caused by a dynamic structural misalignment where certain attention heads act as risky mediators, decoupling from visual evidence to lock onto language priors. The authors propose Fox, a training-free causal intervention framework that diagnoses and physically severs these pathological shortcuts, achieving state-of-the-art performance in faithful decoding.

0 favorites 0 likes

#causal-intervention

The Weight Norm Sets the Grokking Timescale: A Causal Delay Law

arXiv cs.LG ↗ · 2026-06-15 Cached

This paper demonstrates that the weight norm causally controls the timescale of grokking in neural networks, reconciling conflicting accounts. Through interventions, it shows that grokking follows an exponential delay law and that norm magnitude dominates grokking time over learning rate across architectures.

0 favorites 0 likes

#causal-intervention

Causal Evidence for Attention Head Imbalance in Modality Conflict Hallucination

arXiv cs.AI ↗ · 2026-05-20 Cached

This paper identifies imbalanced attention head groups in MLLMs that drive or resist modality-conflict hallucination, and proposes MACI, a causal intervention that suppresses hallucination-driving heads only when conflict is detected, achieving large hallucination reduction across five models.

0 favorites 0 likes

causal-intervention

Dismantling Pathological Shortcuts: A Causal Framework for Faithful LVLM Decoding

The Weight Norm Sets the Grokking Timescale: A Causal Delay Law

Causal Evidence for Attention Head Imbalance in Modality Conflict Hallucination

Submit Feedback