@AdinaYakup: BitCPM4-CANN Native 1.58-bit LLM training system on Ascend NPUs https://huggingface.co/collections/openbmb/bitcpm4-cann…

X AI KOLs Following Models

Summary

OpenBMB releases BitCPM4-CANN, a collection of natively trained 1.58-bit ternary quantized LLMs (0.5B to 8B) optimized for Ascend NPUs via CANN, achieving 6× memory reduction at inference and minimal training overhead.

BitCPM4-CANN Native 1.58-bit LLM training system on Ascend NPUs https://huggingface.co/collections/openbmb/bitcpm4-cann… 0.5B/1B/3B/8B - Apache 2.0 6× less memory at inference Only 4.5% training throughput overhead
Original Article
View Cached Full Text

Cached at: 05/23/26, 12:05 PM

BitCPM4-CANN Native 1.58-bit LLM training system on Ascend NPUs https://huggingface.co/collections/openbmb/bitcpm4-cann… 0.5B/1B/3B/8B - Apache 2.0 6× less memory at inference Only 4.5% training throughput overhead


BitCPM4-CANN - a openbmb Collection

Source: https://huggingface.co/collections/openbmb/bitcpm4-cann updatedabout 23 hours ago

Full-pipeline ternary quantized model trained on CANN.

Similar Articles

NEW BITNET MODELS!

Reddit r/LocalLLaMA

New BitCPM4-CANN models (1B, 3B, 8B) from OpenBMB released on Hugging Face; awaiting llamacpp support for testing.