Tag
SemiPrune is a label-efficient dataset pruning framework that uses semi-supervised learning to generate pseudo-labels from a small labeled subset, enabling existing supervised pruning methods to work with unlabeled data. It achieves state-of-the-art performance on domain-specific, image-corrupted, and long-tailed datasets.