inference-misalignment

Tag

Cards List
#inference-misalignment

Understanding Why Language Models Hallucinate: Testing Reasoning Against Priors

arXiv cs.CL · 3d ago Cached

This paper studies why language models hallucinate, proposing that hallucinations often stem from biased latent inference (inference misalignment) rather than missing knowledge. It introduces TrapQA, a controlled diagnostic testbed to test reasoning against priors, and demonstrates that hallucinations can arise from misleading latent associations.

0 favorites 0 likes
← Back to home

Submit Feedback