geometric-entropy-mixing

Tag

Cards List
#geometric-entropy-mixing

GEM: Geometric Entropy Mixing for Optimal LLM Data Curation

arXiv cs.LG · 2026-05-27 Cached

GEM reformulates LLM data curation as a variational problem on the hypersphere, using geometric entropy mixing and a minorize-maximize algorithm to discover balanced semantic clusters, achieving state-of-the-art improvements in data mixing strategies by up to 1.2% average downstream accuracy.

0 favorites 0 likes
← Back to home

Submit Feedback