attribution-methods

Tag

Cards List
#attribution-methods

GRALIS: A Unified Canonical Framework for Linear Attribution Methods via Riesz Representation

arXiv cs.LG · 2026-05-08 Cached

This arXiv preprint introduces GRALIS, a unified mathematical framework using Riesz Representation Theory to formalize and compare linear attribution methods like SHAP, LIME, and Integrated Gradients.

0 favorites 0 likes
#attribution-methods

TPA: Next Token Probability Attribution for Detecting Hallucinations in RAG

arXiv cs.CL · 2026-04-20 Cached

TPA proposes a novel method for detecting hallucinations in RAG systems by attributing next-token probabilities to seven distinct sources (Query, RAG Context, Past Token, Self Token, FFN, Final LayerNorm, Initial Embedding) and aggregating by Part-of-Speech tags. The approach achieves state-of-the-art performance across five LLMs including Llama2, Llama3, Mistral, and Qwen.

0 favorites 0 likes
← Back to home

Submit Feedback