sentiment-classification

#sentiment-classification

Applied Explainability for Large Language Models: A Comparative Study

arXiv cs.CL ↗ · 2026-04-20 Cached

A comparative study evaluating three explainability techniques (Integrated Gradients, Attention Rollout, SHAP) on fine-tuned DistilBERT for sentiment classification, highlighting trade-offs between gradient-based, attention-based, and model-agnostic approaches for LLM interpretability.

0 favorites 0 likes

sentiment-classification

Applied Explainability for Large Language Models: A Comparative Study

Submit Feedback