Tag
This paper investigates intrinsic data metrics to predict the utility of reasoning supervision before costly fine-tuning, finding that smaller models benefit from alignment-focused metrics while larger models gain from verbose traces, thus establishing a scale-aware framework for validating reasoning datasets.