hallucination-mitigation

Tag

Cards List
#hallucination-mitigation

When the Database Fails: Prompting LLM Dialogue Agents for Safe Recovery in Task-Oriented Dialogue

arXiv cs.CL · 4d ago Cached

This paper studies a lightweight prompting-based recovery approach for LLM dialogue agents when backend database calls fail, showing that the Guided-Retry strategy reduces hallucination by 50% on MultiWOZ and 42% on SGD across six model families.

0 favorites 0 likes
#hallucination-mitigation

Dismantling Pathological Shortcuts: A Causal Framework for Faithful LVLM Decoding

arXiv cs.AI · 6d ago Cached

This paper reveals that hallucination in large vision-language models is caused by a dynamic structural misalignment where certain attention heads act as risky mediators, decoupling from visual evidence to lock onto language priors. The authors propose Fox, a training-free causal intervention framework that diagnoses and physically severs these pathological shortcuts, achieving state-of-the-art performance in faithful decoding.

0 favorites 0 likes
#hallucination-mitigation

CaVe-VLM-CoT: An Interpretable Vision-Language Model Framework

arXiv cs.AI · 2026-06-18 Cached

CaVe-VLM-CoT is a modular reflection-based agentic-RAG framework for vision-language models that enforces evidence-grounded reasoning through a five-stage pipeline, achieving 87.1% accuracy on ScienceQA and proposing a suite of 23 metrics for evaluation.

0 favorites 0 likes
#hallucination-mitigation

MODE-RAG: Manifold Outlier Diagnosis and Energy-based Retrieval-Augmented Generation Evaluation

arXiv cs.CL · 2026-06-17 Cached

Introduces MODE-RAG, a multi-agent system using Variational Free Energy and Monte Carlo Tree Search to dynamically gate interventions for mitigating hallucinations in Multimodal Retrieval-Augmented Generation systems, along with the ModeVent evaluation dataset.

0 favorites 0 likes
#hallucination-mitigation

Trust but Verify: Mitigating Medical Hallucinations via Post-Hoc Adversarial Auditing and Multi-Agent Feedback Loops

arXiv cs.LG · 2026-06-15 Cached

This paper proposes a multi-agent 'Trust but Verify' system to reduce medical hallucinations in LLMs. It tests three open-access models on clinical questions about banned drugs and achieves a 53% reduction in hallucination error rate.

0 favorites 0 likes
#hallucination-mitigation

Can we stop dunking on DiffusionGemma and hack it instead?

Reddit r/LocalLLaMA · 2026-06-14

Discusses various methods to optimize DiffusionGemma inference, reduce hallucination, and improve performance for tool use and agents, including entropy-bounded sampling, schema scaffolding, and retrieval during denoising.

0 favorites 0 likes
#hallucination-mitigation

NTS-CoT: Mitigating Hallucinations in LLM-based News Timeline Summarization with Chain-of-Thought Reasoning

arXiv cs.CL · 2026-06-12 Cached

This paper proposes NTS-CoT, a novel framework that uses Chain-of-Thought reasoning to mitigate hallucinations in LLM-based news timeline summarization. It introduces three modules—Element-CoT, Date Selection, and Causal-CoT—to improve faithfulness and reduce omissions, outperforming state-of-the-art baselines on three benchmarks.

0 favorites 0 likes
#hallucination-mitigation

Mitigating Manifold Departure: Uncertainty-Aware Subspace Rectification for Trustworthy MLLM Decoding

arXiv cs.LG · 2026-06-10 Cached

This paper introduces MGAP, a training-free decoding method that reduces hallucinations in Multimodal Large Language Models by adaptively suppressing only the harmful parts of language priors while preserving the model's semantic manifold. The method outperforms prior baselines on POPE and CHAIR benchmarks.

0 favorites 0 likes
#hallucination-mitigation

Whisper Hallucination Detection and Mitigation via Hidden Representation Steering and Sparse AutoEncoders

Hugging Face Daily Papers · 2026-06-05 Cached

This paper demonstrates that Whisper's hallucination failures on silence, noise, or music can be detected and mitigated purely from internal activations using sparse autoencoders, achieving large reductions in hallucination rate without fine-tuning.

0 favorites 0 likes
#hallucination-mitigation

TIGER: Traceable Inference with Graph-Based Evidence Routing for Mitigating Hallucinations in Multimodal Generation

arXiv cs.AI · 2026-06-02 Cached

TIGER is an inference-time framework that mitigates hallucinations in multimodal generation by extracting observation and claim graphs and assigning risk scores to repair unsupported facts. It reduces unsupported content across image-to-text, image+text-to-text, audio-to-text, and video-to-text tasks.

0 favorites 0 likes
#hallucination-mitigation

MeasHalu: Mitigation of Scientific Measurement Hallucinations for Large Language Models with Enhanced Reasoning

arXiv cs.CL · 2026-04-21 Cached

MeasHalu is a novel framework for mitigating scientific measurement hallucinations in LLMs through a two-stage reasoning-aware fine-tuning strategy and progressive reward curriculum. It introduces a fine-grained taxonomy of measurement-specific hallucinations and demonstrates improved accuracy on the MeasEval benchmark.

0 favorites 0 likes
#hallucination-mitigation

Wisdom is Knowing What not to Say: Hallucination-Free LLMs Unlearning via Attention Shifting

arXiv cs.CL · 2026-04-20 Cached

This paper introduces Attention-Shifting (AS), a novel framework for selective machine unlearning in LLMs that balances effective removal of sensitive information while preventing hallucinations and preserving model utility. The method uses importance-aware attention suppression and retention enhancement to achieve up to 15% higher accuracy preservation compared to existing unlearning approaches on standard benchmarks.

0 favorites 0 likes
#hallucination-mitigation

FineSteer: A Unified Framework for Fine-Grained Inference-Time Steering in Large Language Models

arXiv cs.CL · 2026-04-20 Cached

FineSteer is a novel inference-time steering framework that decomposes steering into conditional steering and fine-grained vector synthesis stages, using Subspace-guided Conditional Steering (SCS) and Mixture-of-Steering-Experts (MoSE) mechanisms to improve safety and truthfulness while preserving model utility. Experiments show 7.6% improvement over state-of-the-art methods on TruthfulQA with minimal utility loss.

0 favorites 0 likes
#hallucination-mitigation

Mitigating Multimodal Hallucination via Phase-wise Self-reward

Hugging Face Daily Papers · 2026-04-20 Cached

PSRD framework halves multimodal hallucination in LVLMs by using phase-wise self-reward decoding and a distilled lightweight reward model without extra supervision.

0 favorites 0 likes
← Back to home

Submit Feedback