@AdinaYakup: BitCPM4-CANN Native 1.58-bit LLM training system on Ascend NPUs https://huggingface.co/collections/openbmb/bitcpm4-cann…

X AI KOLs Following 05/22/26, 03:14 PM Models

ternary-quantization llm ascend-npu open-source model-release bitcpm4

Summary

OpenBMB releases BitCPM4-CANN, a collection of natively trained 1.58-bit ternary quantized LLMs (0.5B to 8B) optimized for Ascend NPUs via CANN, achieving 6× memory reduction at inference and minimal training overhead.

BitCPM4-CANN Native 1.58-bit LLM training system on Ascend NPUs https://huggingface.co/collections/openbmb/bitcpm4-cann… 0.5B/1B/3B/8B - Apache 2.0 6× less memory at inference Only 4.5% training throughput overhead

Original Article

View Cached Full Text

Cached at: 05/23/26, 12:05 PM

BitCPM4-CANN - a openbmb Collection

Source: https://huggingface.co/collections/openbmb/bitcpm4-cann updatedabout 23 hours ago

Full-pipeline ternary quantized model trained on CANN.

Similar Articles

@rohanpaul_ai: BitCPM-CANN just became the world’s first open-sourced 1.58-bit ternary LLM trained entirely on Chinese-developed AI in…

X AI KOLs Following

BitCPM-CANN is the first open-source 1.58-bit ternary LLM trained entirely on Chinese-developed AI infrastructure (Huawei Ascend 910B), offering extreme memory reduction for edge deployment.

NEW BITNET MODELS!

Reddit r/LocalLLaMA

New BitCPM4-CANN models (1B, 3B, 8B) from OpenBMB released on Hugging Face; awaiting llamacpp support for testing.

OpenBMB presents the model BitCPM-CANN 1.58 bit

Reddit r/LocalLLaMA

OpenBMB introduced BitCPM-CANN, a 1.58-bit model being tested on Huawei Ascend 910B hardware.

@AdinaYakup: MiniCPM V4.6 a 1B MLLM that actually runs on your phone, just released by @OpenBMB 1B - Apache2.0 Runs on iOS, Android,…

X AI KOLs Following

OpenBMB has released MiniCPM V4.6, a 1B-parameter multimodal large language model optimized for mobile devices under the Apache 2.0 license. It features mixed visual token compression and claims approximately 1.5x faster throughput than Qwen3.5 0.8B while running natively on iOS, Android, and HarmonyOS.

@FeitengLi: OpenBMB open-sources MiniCPM-V 4.6, 1.3B parameters (SigLIP2-400M + Qwen3.5-0.8B), 262k context, visual encoding FLOPs 50%+ less than previous generation. Token cost for the same task is lower than Qwen3.5-0…

X AI KOLs Timeline

OpenBMB releases MiniCPM-V 4.6, a 1.3B-parameter multimodal LLM with 262k context and significantly reduced visual encoding FLOPs, achieving strong benchmark performance and broad inference framework support.