interpretable-ai

#interpretable-ai

ERP-XTTN: Interpretable Prototype-Guided Cross-Attention for Cross-Subject ERP Classification

arXiv cs.LG ↗ · 5d ago Cached

Introduces ERP-XTTN, a cross-attention architecture for interpretable ERP classification across subjects without calibration. Evaluated on multiple datasets, it achieves competitive performance with black-box models while providing transparent routing insights.

0 favorites 0 likes

#interpretable-ai

Surfacing Isolated Learners with Outcome-Independent Mediation of Feedback between Teachers and Students Using AI

arXiv cs.AI ↗ · 2026-05-29 Cached

This paper proposes an interpretable decision layer for AI-augmented classrooms that combines teacher and student feedback to rank course topics needing attention without using grades. The approach surfaces isolated learners and aligns with instructor concerns in a preliminary study.

0 favorites 0 likes

#interpretable-ai

Distinguishing Right from Wrong in Debates: Attribution Analysis of Chinese Harmful Memes

arXiv cs.CL ↗ · 2026-05-26 Cached

This paper introduces Ex-ToxiCN-MM, the first Chinese harmful meme explanation dataset, along with a knowledge base C-HarmKB and an attribution analysis framework RIKE, to improve interpretable detection of harmful memes by considering cultural context and ambiguity.

0 favorites 0 likes

#interpretable-ai

Interpretable Discriminative Text Representations via Agreement and Label Disentanglement

arXiv cs.CL ↗ · 2026-05-21 Cached

This paper proposes an operational criterion for interpretable text representations based on inter-annotator agreement and label disentanglement, and introduces LLM-assisted Feature Discovery (LFD), a method that uses cross-LLM agreement screening and residual predictive gain to select clear, label-disentangled features. Experiments show LFD matches predictive performance while producing more interpretable features, validated by human audits.

0 favorites 0 likes

#interpretable-ai

OceanCBM: A Concept Bottleneck Model for Mechanistic Interpretability in Ocean Forecasting

arXiv cs.LG ↗ · 2026-05-14 Cached

OceanCBM is a concept bottleneck model for spatiotemporal prediction and mechanistic interpretability in ocean forecasting, using mixed supervision to predict mixed layer heat content while imposing soft physical structure. The model achieves interpretable, physically grounded representations without sacrificing predictive skill.

0 favorites 0 likes

interpretable-ai

ERP-XTTN: Interpretable Prototype-Guided Cross-Attention for Cross-Subject ERP Classification

Surfacing Isolated Learners with Outcome-Independent Mediation of Feedback between Teachers and Students Using AI

Distinguishing Right from Wrong in Debates: Attribution Analysis of Chinese Harmful Memes

Interpretable Discriminative Text Representations via Agreement and Label Disentanglement

OceanCBM: A Concept Bottleneck Model for Mechanistic Interpretability in Ocean Forecasting

Submit Feedback