shortcut-detection

Tag

Cards List
#shortcut-detection

Calibration vs Decision Making: Revisiting the Reliability Paradox in Unlearned Language Models

arXiv cs.CL · 2026-05-21 Cached

This paper revisits the reliability paradox in the context of machine unlearning for language models, demonstrating that models can achieve low calibration error while relying on shortcut-based decision rules, thereby extending the paradox to unlearned models.

0 favorites 0 likes
← Back to home

Submit Feedback