Huawei open-sources OpenPangu-2.0-Flash - 92B total,6B active

Reddit r/LocalLLaMA 06/30/26, 11:58 AM Models

huawei open-source pangu-model moe llm ai-model

Summary

Huawei open-sources OpenPangu-2.0-Flash, a 92B-parameter MoE model with 6B active parameters and 512K context, along with inference code and training operations.

https://x.com/Chinazhidx/status/2071877413685109071 TODAY: #Huawei open-sources OpenPangu-2.0-Flash #OpenPangu 2.0 includes two 512K-context models: • Flash: 92B total,6B active—Weights+inference code+training ops released • Pro: 505B total,18B active—flagship model, coming in July More open-source components later this year https://preview.redd.it/29tji3noteah1.png?width=1446&format=png&auto=webp&s=836b711cc97c5efb3d37126105a11a7d20c49ca2 https://x.com/CalatheaAI/status/2071917592810496273

Original Article

Similar Articles

Huawei Released openPangu 2.0 (Will open source on June 30)

Reddit r/LocalLLaMA

Huawei announced openPangu 2.0, an open-source large model with 505B total parameters and a 28:1 sparsity ratio, optimized for Ascend computing and HarmonyOS, with key components to be open-sourced starting June 30.

ascend-tribe/openPangu-2.0-Flash (They haven't uploaded it to Huggingface yet）

Reddit r/LocalLLaMA

openPangu-2.0-Flash is a 92B MoE model with 6B activated parameters, 512k context, trained on Ascend with 34T tokens, incorporating slow/fast thinking and multiple RL training stages.

@sheriyuo: The industry's first trillion-parameter model to complete end-to-end training and inference on a 50,000-GPU Chinese com…

X AI KOLs Timeline

Meituan released LongCat-2.0, a 1.6T-parameter MoE model with 1M context, claimed as the first to train on a 50,000-GPU Chinese cluster, now available on OpenRouter for agentic coding.

@witcheer: can’t believe gpt-oss-20b perfs on 8GB vRAM 21B total params, 3.6B active (MoE). OpenAI, Apache 2.0. uses only 1.8 GB V…

X AI KOLs Timeline

A new open-source MoE model, gpt-oss-20b (21B total, 3.6B active), runs on only 1.8GB VRAM and achieves perfect scores on agentic coding tasks, outperforming other local models like Gemma and Qwen.

@FeitengLi: OpenBMB open-sources MiniCPM-V 4.6, 1.3B parameters (SigLIP2-400M + Qwen3.5-0.8B), 262k context, visual encoding FLOPs 50%+ less than previous generation. Token cost for the same task is lower than Qwen3.5-0…

X AI KOLs Timeline

OpenBMB releases MiniCPM-V 4.6, a 1.3B-parameter multimodal LLM with 262k context and significantly reduced visual encoding FLOPs, achieving strong benchmark performance and broad inference framework support.

Similar Articles

Huawei Released openPangu 2.0 (Will open source on June 30)

ascend-tribe/openPangu-2.0-Flash (They haven't uploaded it to Huggingface yet）

@sheriyuo: The industry's first trillion-parameter model to complete end-to-end training and inference on a 50,000-GPU Chinese com…

@witcheer: can’t believe gpt-oss-20b perfs on 8GB vRAM 21B total params, 3.6B active (MoE). OpenAI, Apache 2.0. uses only 1.8 GB V…

@FeitengLi: OpenBMB open-sources MiniCPM-V 4.6, 1.3B parameters (SigLIP2-400M + Qwen3.5-0.8B), 262k context, visual encoding FLOPs 50%+ less than previous generation. Token cost for the same task is lower than Qwen3.5-0…

Submit Feedback