inference-misalignment

#inference-misalignment

Understanding Why Language Models Hallucinate: Testing Reasoning Against Priors

arXiv cs.CL ↗ · 3d ago Cached

This paper studies why language models hallucinate, proposing that hallucinations often stem from biased latent inference (inference misalignment) rather than missing knowledge. It introduces TrapQA, a controlled diagnostic testbed to test reasoning against priors, and demonstrates that hallucinations can arise from misleading latent associations.

0 favorites 0 likes

inference-misalignment

Understanding Why Language Models Hallucinate: Testing Reasoning Against Priors

Submit Feedback