causal-caution

#causal-caution

When Helpfulness Overrides Causal Caution: Context-Dependent Suppression and Recovery in LLMs

arXiv cs.AI ↗ · 2026-06-24 Cached

This paper investigates how the tension between helpfulness and safety in LLMs leads to context-dependent suppression and recovery of certain behaviors, showing that the drive to be helpful can override causal caution mechanisms.

0 favorites 0 likes

causal-caution

When Helpfulness Overrides Causal Caution: Context-Dependent Suppression and Recovery in LLMs

Submit Feedback