prototype-reassignment

#prototype-reassignment

ConMoE: Expert-Pool Consolidation via Prototype Reassignment for MoE Compression

arXiv cs.AI ↗ · 2026-05-29 Cached

ConMoE proposes a train-free prototype remapping framework for Mixture-of-Experts (MoE) compression, which selects a subset of experts as reusable prototypes and deterministically remaps original expert calls to them, reducing memory usage without weight updates or fine-tuning.

0 favorites 0 likes

prototype-reassignment

ConMoE: Expert-Pool Consolidation via Prototype Reassignment for MoE Compression

Submit Feedback