core-set-selection

#core-set-selection

MADS: Model-Aware Diverse Core Set Selection for Instruction Tuning

arXiv cs.CL ↗ · 2026-06-01 Cached

This paper proposes MADS, a method that leverages neural activation states from LLMs to select diverse core sets for instruction tuning, showing that a 15% subset can outperform full-dataset fine-tuning on multiple benchmarks.

0 favorites 0 likes

core-set-selection

MADS: Model-Aware Diverse Core Set Selection for Instruction Tuning

Submit Feedback