proxy-training

Tag

Cards List
#proxy-training

RegMix-D: Dynamic Data Mixing via Proxy Training Trajectories

arXiv cs.CL · 2d ago Cached

RegMix-D extends RegMix to dynamic data mixing by using loss trajectories from proxy runs to predict optimal mixtures at multiple training stages, achieving improvements over static methods.

0 favorites 0 likes
← Back to home

Submit Feedback