prototype-reassignment

Tag

Cards List
#prototype-reassignment

ConMoE: Expert-Pool Consolidation via Prototype Reassignment for MoE Compression

arXiv cs.AI · 2026-05-29 Cached

ConMoE proposes a train-free prototype remapping framework for Mixture-of-Experts (MoE) compression, which selects a subset of experts as reusable prototypes and deterministically remaps original expert calls to them, reducing memory usage without weight updates or fine-tuning.

0 favorites 0 likes
← Back to home

Submit Feedback