@rohanpaul_ai: BitCPM-CANN just became the world’s first open-sourced 1.58-bit ternary LLM trained entirely on Chinese-developed AI in…
Summary
BitCPM-CANN is the first open-source 1.58-bit ternary LLM trained entirely on Chinese-developed AI infrastructure (Huawei Ascend 910B), offering extreme memory reduction for edge deployment.
View Cached Full Text
Cached at: 05/24/26, 04:16 AM
BitCPM-CANN just became the world’s first open-sourced 1.58-bit ternary LLM trained entirely on Chinese-developed AI infrastructure.
Developed by ModelBest, Tsinghua Univ, and OpenBMB community, the entire training pipeline, from quantization operators and algorithms to the full-stack framework, was natively executed on Huawei Ascend 910B NPUs.
1.58-bit ternary weights use only 3 weight states, so the model needs far less memory when deployed on phones, PCs, cars, and local industrial devices.
The harder achievement is the training system behind it: QAT, STE, low-bit operators, algorithms, framework work, and reproducible training scripts all had to hold together on Ascend 910B.
When hardware costs rise, the winning model is not merely the one that scores higher in a chart, but the one that can be trained, reproduced, deployed, and improved under real constraints.
OpenBMB (@OpenBMB): 🚀 BitCPM-CANN by ModelBest × @Tsinghua_Uni × OpenBMB is here — and it’s not about stacking parameters. Memory costs are skyrocketing. Hardware constraints are tightening. Edge AI needs smarter solutions — and BitCPM-CANN delivers!🎉
✅ Edge-ready: 8B model runs smoothly on
Similar Articles
@AdinaYakup: BitCPM4-CANN Native 1.58-bit LLM training system on Ascend NPUs https://huggingface.co/collections/openbmb/bitcpm4-cann…
OpenBMB releases BitCPM4-CANN, a collection of natively trained 1.58-bit ternary quantized LLMs (0.5B to 8B) optimized for Ascend NPUs via CANN, achieving 6× memory reduction at inference and minimal training overhead.
@heyshrutimishra: Full-sized AI models now run on phones. That's BitCPM, a new open-source model from ModelBest, Tsinghua, and OpenBMB. T…
BitCPM is a new open-source model from ModelBest, Tsinghua, and OpenBMB that uses ternary weights (-1,0,1) to run full-sized AI models on phones.
OpenBMB presents the model BitCPM-CANN 1.58 bit
OpenBMB introduced BitCPM-CANN, a 1.58-bit model being tested on Huawei Ascend 910B hardware.
Ternary Bonsai: Top Intelligence at 1.58 Bits
A highly efficient AI model architecture using ternary weights (-1, 0, 1) that achieves competitive performance while requiring only 1.58 bits per parameter, enabling deployment on extremely constrained devices.
@AdinaYakup: MiniCPM5-1B is an impressive release in the 1B class! @OpenBMB https://huggingface.co/collections/openbmb/minicpm5… 1B …
MiniCPM5-1B is a new 1B parameter AI model from OpenBMB featuring hybrid reasoning with Think/No-Think modes, 128K context, and Apache 2.0 license, running on various hardware.