distribution-correction

Tag

Cards List
#distribution-correction

Distribution Corrected Offline Data Distillation for Large Language Models

arXiv cs.CL · 2026-05-15 Cached

This paper proposes a principled offline reasoning distillation framework that corrects teacher-student distribution drift, improving reasoning accuracy on math benchmarks without requiring online rollouts.

0 favorites 0 likes
← Back to home

Submit Feedback