ternary-quantization

#ternary-quantization

@AdinaYakup: BitCPM4-CANN Native 1.58-bit LLM training system on Ascend NPUs https://huggingface.co/collections/openbmb/bitcpm4-cann…

X AI KOLs Following ↗ · 2026-05-22 Cached

OpenBMB releases BitCPM4-CANN, a collection of natively trained 1.58-bit ternary quantized LLMs (0.5B to 8B) optimized for Ascend NPUs via CANN, achieving 6× memory reduction at inference and minimal training overhead.

0 favorites 0 likes

#ternary-quantization

Tequila: Trapping-free Ternary Quantization for Large Language Models

Papers with Code Trending ↗ · 2025-09-28 Cached

This paper introduces Tequila, a trapping-free quantization method for Large Language Models that improves ternary quantization accuracy and inference speed by repurposing deadzone-trapped weights as dynamic biases.

0 favorites 0 likes

ternary-quantization

@AdinaYakup: BitCPM4-CANN Native 1.58-bit LLM training system on Ascend NPUs https://huggingface.co/collections/openbmb/bitcpm4-cann…

Tequila: Trapping-free Ternary Quantization for Large Language Models

Submit Feedback