causal-caution

Tag

Cards List
#causal-caution

When Helpfulness Overrides Causal Caution: Context-Dependent Suppression and Recovery in LLMs

arXiv cs.AI · 2026-06-24 Cached

This paper investigates how the tension between helpfulness and safety in LLMs leads to context-dependent suppression and recovery of certain behaviors, showing that the drive to be helpful can override causal caution mechanisms.

0 favorites 0 likes
← Back to home

Submit Feedback