faithfulness

Tag

Cards List
#faithfulness

Architecture-Aware Explanation Auditing for Industrial Visual Inspection

arXiv cs.LG · 2026-05-15 Cached

This paper introduces an architecture-aware explanation audit protocol for industrial visual inspection, demonstrating that the faithfulness of explanation methods is bounded by their structural compatibility with a model's native decision mechanism, using experiments on wafer map and anomaly detection datasets.

0 favorites 0 likes
#faithfulness

RaguTeam at SemEval-2026 Task 8: Meno and Friends in a Judge-Orchestrated LLM Ensemble for Faithful Multi-Turn Response Generation

Hugging Face Daily Papers · 2026-05-06 Cached

This paper presents the winning system for SemEval-2026 Task 8's generation subtask, using a heterogeneous ensemble of seven LLMs with dual prompting strategies and a GPT-4o-mini judge to select the best response. The system achieved first place with a conditioned harmonic mean of 0.7827, outperforming all baselines and demonstrating the value of model diversity.

0 favorites 0 likes
#faithfulness

Faithfulness-Aware Uncertainty Quantification for Fact-Checking the Output of Retrieval Augmented Generation

arXiv cs.CL · 2026-04-20 Cached

This paper introduces FRANQ, a method for detecting hallucinations in Retrieval-Augmented Generation (RAG) systems by applying distinct uncertainty quantification techniques to distinguish between factuality and faithfulness to retrieved context. The authors construct a new dataset annotated for both factuality and faithfulness, and demonstrate that FRANQ outperforms existing approaches in detecting factual errors across multiple datasets and LLMs.

0 favorites 0 likes
#faithfulness

AtManRL: Towards Faithful Reasoning via Differentiable Attention Saliency

arXiv cs.CL · 2026-04-20 Cached

AtManRL is a method that uses differentiable attention manipulation and reinforcement learning to train LLMs to generate more faithful chain-of-thought reasoning by ensuring reasoning tokens causally influence final predictions. Experiments on GSM8K and MMLU with Llama-3.2-3B demonstrate the approach can identify influential reasoning tokens and improve reasoning transparency.

0 favorites 0 likes
← Previous
← Back to home

Submit Feedback