Tag
This paper investigates the trade-off between plausibility and faithfulness in cross-lingual explanations from LLMs, finding that English-pivot explanations achieve higher span agreement with human rationales but suffer reduced causal faithfulness compared to native-language explanations.
The article explores the concept of illusions of understanding in scientific practice, discussing how ambiguous language, incomplete causal accounts, and satisfying but incomplete explanations can lead scientists to overlook deeper understanding.