@AdinaYakup: BitCPM4-CANN Native 1.58-bit LLM training system on Ascend NPUs https://huggingface.co/collections/openbmb/bitcpm4-cann…
Summary
OpenBMB releases BitCPM4-CANN, a collection of natively trained 1.58-bit ternary quantized LLMs (0.5B to 8B) optimized for Ascend NPUs via CANN, achieving 6× memory reduction at inference and minimal training overhead.
View Cached Full Text
Cached at: 05/23/26, 12:05 PM
BitCPM4-CANN Native 1.58-bit LLM training system on Ascend NPUs https://huggingface.co/collections/openbmb/bitcpm4-cann… 0.5B/1B/3B/8B - Apache 2.0 6× less memory at inference Only 4.5% training throughput overhead
BitCPM4-CANN - a openbmb Collection
Source: https://huggingface.co/collections/openbmb/bitcpm4-cann updatedabout 23 hours ago
Full-pipeline ternary quantized model trained on CANN.
- —
#### openbmb/BitCPM4-CANN-0.5B-gguf Text Generation• 0.4B• Updated1 day ago • 185 • 2 - —
#### openbmb/BitCPM4-CANN-1B-gguf Text Generation• 2B• Updated1 day ago • 174 • 1 - —
#### openbmb/BitCPM4-CANN-3B-gguf Text Generation• 4B• Updated1 day ago • 162 • 2 - —
#### openbmb/BitCPM4-CANN-8B-gguf Text Generation• 8B• Updated1 day ago • 256 • 7 - —
#### openbmb/BitCPM4-CANN-0.5B Text Generation• Updated1 day ago • 176 • 5 - —
#### openbmb/BitCPM4-CANN-1B Text Generation• Updated1 day ago • 58 • 5 - —
#### openbmb/BitCPM4-CANN-3B Text Generation• Updated1 day ago • 82 • 6 - —
#### openbmb/BitCPM4-CANN-8B Text Generation• Updated1 day ago • 204 • 12 - —
#### openbmb/BitCPM4-CANN-0.5B-unquantized Text Generation• Updatedabout 21 hours ago • 50 - —
#### openbmb/BitCPM4-CANN-1B-unquantized Text Generation• Updatedabout 21 hours ago • 71 - —
#### openbmb/BitCPM4-CANN-3B-unquantized Text Generation• Updatedabout 21 hours ago • 64 - —
#### openbmb/BitCPM4-CANN-8B-unquantized Text Generation• Updatedabout 21 hours ago • 72
Similar Articles
@rohanpaul_ai: BitCPM-CANN just became the world’s first open-sourced 1.58-bit ternary LLM trained entirely on Chinese-developed AI in…
BitCPM-CANN is the first open-source 1.58-bit ternary LLM trained entirely on Chinese-developed AI infrastructure (Huawei Ascend 910B), offering extreme memory reduction for edge deployment.
NEW BITNET MODELS!
New BitCPM4-CANN models (1B, 3B, 8B) from OpenBMB released on Hugging Face; awaiting llamacpp support for testing.
OpenBMB presents the model BitCPM-CANN 1.58 bit
OpenBMB introduced BitCPM-CANN, a 1.58-bit model being tested on Huawei Ascend 910B hardware.
@AdinaYakup: MiniCPM V4.6 a 1B MLLM that actually runs on your phone, just released by @OpenBMB 1B - Apache2.0 Runs on iOS, Android,…
OpenBMB has released MiniCPM V4.6, a 1B-parameter multimodal large language model optimized for mobile devices under the Apache 2.0 license. It features mixed visual token compression and claims approximately 1.5x faster throughput than Qwen3.5 0.8B while running natively on iOS, Android, and HarmonyOS.
@FeitengLi: OpenBMB open-sources MiniCPM-V 4.6, 1.3B parameters (SigLIP2-400M + Qwen3.5-0.8B), 262k context, visual encoding FLOPs 50%+ less than previous generation. Token cost for the same task is lower than Qwen3.5-0…
OpenBMB releases MiniCPM-V 4.6, a 1.3B-parameter multimodal LLM with 262k context and significantly reduced visual encoding FLOPs, achieving strong benchmark performance and broad inference framework support.