@rohanpaul_ai: BitCPM-CANN just became the world’s first open-sourced 1.58-bit ternary LLM trained entirely on Chinese-developed AI in…

X AI KOLs Following 05/22/26, 02:36 PM Models

open-source ternary-llm 1.58-bit edge-ai chinese-ai model-compression quantization

Summary

BitCPM-CANN is the first open-source 1.58-bit ternary LLM trained entirely on Chinese-developed AI infrastructure (Huawei Ascend 910B), offering extreme memory reduction for edge deployment.

BitCPM-CANN just became the world’s first open-sourced 1.58-bit ternary LLM trained entirely on Chinese-developed AI infrastructure. Developed by ModelBest, Tsinghua Univ, and OpenBMB community, the entire training pipeline, from quantization operators and algorithms to the full-stack framework, was natively executed on Huawei Ascend 910B NPUs. 1.58-bit ternary weights use only 3 weight states, so the model needs far less memory when deployed on phones, PCs, cars, and local industrial devices. The harder achievement is the training system behind it: QAT, STE, low-bit operators, algorithms, framework work, and reproducible training scripts all had to hold together on Ascend 910B. When hardware costs rise, the winning model is not merely the one that scores higher in a chart, but the one that can be trained, reproduced, deployed, and improved under real constraints.

Original Article

View Cached Full Text

Cached at: 05/24/26, 04:16 AM

BitCPM-CANN just became the world’s first open-sourced 1.58-bit ternary LLM trained entirely on Chinese-developed AI infrastructure.

Developed by ModelBest, Tsinghua Univ, and OpenBMB community, the entire training pipeline, from quantization operators and algorithms to the full-stack framework, was natively executed on Huawei Ascend 910B NPUs.

1.58-bit ternary weights use only 3 weight states, so the model needs far less memory when deployed on phones, PCs, cars, and local industrial devices.

The harder achievement is the training system behind it: QAT, STE, low-bit operators, algorithms, framework work, and reproducible training scripts all had to hold together on Ascend 910B.

When hardware costs rise, the winning model is not merely the one that scores higher in a chart, but the one that can be trained, reproduced, deployed, and improved under real constraints.

OpenBMB (@OpenBMB): 🚀 BitCPM-CANN by ModelBest × @Tsinghua_Uni × OpenBMB is here — and it’s not about stacking parameters. Memory costs are skyrocketing. Hardware constraints are tightening. Edge AI needs smarter solutions — and BitCPM-CANN delivers！🎉

✅ Edge-ready: 8B model runs smoothly on

@rohanpaul_ai: BitCPM-CANN just became the world’s first open-sourced 1.58-bit ternary LLM trained entirely on Chinese-developed AI in…

Similar Articles

@AdinaYakup: BitCPM4-CANN Native 1.58-bit LLM training system on Ascend NPUs https://huggingface.co/collections/openbmb/bitcpm4-cann…

@heyshrutimishra: Full-sized AI models now run on phones. That's BitCPM, a new open-source model from ModelBest, Tsinghua, and OpenBMB. T…

OpenBMB presents the model BitCPM-CANN 1.58 bit

Ternary Bonsai: Top Intelligence at 1.58 Bits

@AdinaYakup: MiniCPM5-1B is an impressive release in the 1B class! @OpenBMB https://huggingface.co/collections/openbmb/minicpm5… 1B …

Submit Feedback

Similar Articles

@AdinaYakup: BitCPM4-CANN Native 1.58-bit LLM training system on Ascend NPUs https://huggingface.co/collections/openbmb/bitcpm4-cann…

@heyshrutimishra: Full-sized AI models now run on phones. That's BitCPM, a new open-source model from ModelBest, Tsinghua, and OpenBMB. T…

OpenBMB presents the model BitCPM-CANN 1.58 bit
OpenBMB introduced BitCPM-CANN, a 1.58-bit model being tested on Huawei Ascend 910B hardware.

Ternary Bonsai: Top Intelligence at 1.58 Bits

@AdinaYakup: MiniCPM5-1B is an impressive release in the 1B class! @OpenBMB https://huggingface.co/collections/openbmb/minicpm5… 1B …