Tag
A user asks about the feasibility of running GLM-5.2 at 4-bit quantization on four Ascend GX10s or DGX Sparks, wondering about speed and memory for 100k context.
Huawei announced openPangu 2.0, an open-source large model with 505B total parameters and a 28:1 sparsity ratio, optimized for Ascend computing and HarmonyOS, with key components to be open-sourced starting June 30.
Huawei has open-sourced its CANN software toolkit to compete with Nvidia's CUDA, and DeepSeek V4 shows significant inference performance improvements on Huawei Ascend chips.