conflicting-evidence

Tag

Cards List
#conflicting-evidence

ConflictScore: Identifying and Measuring How Language Models Handle Conflicting Evidence

arXiv cs.CL · 3d ago Cached

ConflictScore is a new metric that quantifies how well language models acknowledge conflicting evidence in their grounding documents, decomposing responses into atomic claims and measuring conflict balance. The paper also introduces ConflictBench, a benchmark covering diverse conflict forms, and shows the metric can improve truthfulness on TruthfulQA.

0 favorites 0 likes
← Back to home

Submit Feedback