soft-label

Tag

Cards List
#soft-label

Consistently Informative Soft-Label Temperature for Knowledge Distillation

arXiv cs.LG · 2026-05-21 Cached

Proposes CIST, a method that assigns separate sample-wise adaptive temperatures to teacher and student in knowledge distillation, producing consistently informative soft labels and relaxing rigid logit-scale matching. Experiments on vision and language tasks show consistent improvements over standard KD.

0 favorites 0 likes
← Back to home

Submit Feedback