@heyshrutimishra: OH MY GOD CHINA JUST MATCHED USA FRONTIER CODING AI AT 40-60% LOWER TOKEN COST. XIAOMI JUST DROPPED MiMo-V2.5-Pro score…
Summary
Xiaomi released MiMo-V2.5-Pro, a coding AI scoring 73.7 on SWE-Bench Pro (near Claude Opus 4.6's 77.1) at 40-60% lower token cost than US frontier models.
View Cached Full Text
Cached at: 04/22/26, 09:12 PM
OH MY GOD CHINA JUST MATCHED USA FRONTIER CODING AI AT 40-60% LOWER TOKEN COST. XIAOMI JUST DROPPED MiMo-V2.5-Pro scores 73.7 on SWE-Bench Pro (Claude Opus 4.6 is at 77.1). it’s solving problems that would take human experts WEEKS and it built a complete compiler from
Similar Articles
Xiaomi's new open source, agentic AI coding harness MiMo Code beats Claude Code at ultra-long, 200+ step tasks (14 minute read)
Xiaomi open-sourced MiMo Code, an AI coding assistant with a novel memory architecture that outperforms Claude Code on long-horizon tasks, and includes free access to its MiMo-V2.5 model.
Tested Xiaomi's MiMo V2.5 Pro for autonomous coding: 301 commits, 60+ pages, $70 in API costs. Now it's open-source.
Xiaomi has open-sourced its MiMo V2.5 Pro model, a 1.02T parameter MoE model designed for autonomous coding tasks. The article details a real-world test showing high efficiency with low API costs due to high cache hit rates.
China's Xiaomi MiMo Is Now 15X Faster Than ChatGPT and Claude (4 minute read)
Xiaomi achieved over 1,000 tokens per second inference on its trillion-parameter MiMo-V2.5-Pro-UltraSpeed model using commodity 8-GPU nodes via FP4 quantization and DFlash speculative decoding, outpacing GPT-5.5 and Claude Opus by over 10x.
Two open-sourced models from china just blew claude opus 4.6 out of water. (Kimi 2.6 and xiaomi mimo v2.5 pro)
Chinese teams open-sourced Kimi 2.6 and Xiaomi MiMo v2.5 Pro, reportedly surpassing Claude Opus 4.6 benchmarks.
Xiaomi just claimed 1,000+ tps on a 1T model using a standard 8-GPU server
Xiaomi released MiMo-V2.5-Pro-UltraSpeed in collaboration with TileRT, achieving over 1000 tokens/s decode speed on a 1-trillion-parameter model, enabling real-time AI interaction and accelerating coding agents and reasoning tasks.