lrp

Tag

Cards List
#lrp

Contrastive Attribution in the Wild: An Interpretability Analysis of LLM Failures on Realistic Benchmarks

Hugging Face Daily Papers · 2026-04-20 Cached

Researchers apply contrastive LRP-based attribution to analyze why LLMs fail on realistic benchmarks, finding the method gives useful signals in some cases but is not universally reliable.

0 favorites 0 likes
← Back to home

Submit Feedback