Tag
XiaomiMiMo releases MiMo-V2.5-Pro-FP4-DFlash, an FP4-quantized MoE model with block-diffusion speculative decoding to reduce memory and bandwidth for trillion-parameter inference.
Xiaomi open-sourced MiMo-V2.5-Pro, a 1.02 trillion parameter MoE model, prompting a cost-benefit analysis of using its API versus self-hosting for autonomous coding tasks.