Huawei open-sources OpenPangu-2.0-Flash - 92B total,6B active
Summary
Huawei open-sources OpenPangu-2.0-Flash, a 92B-parameter MoE model with 6B active parameters and 512K context, along with inference code and training operations.
Similar Articles
Huawei Released openPangu 2.0 (Will open source on June 30)
Huawei announced openPangu 2.0, an open-source large model with 505B total parameters and a 28:1 sparsity ratio, optimized for Ascend computing and HarmonyOS, with key components to be open-sourced starting June 30.
ascend-tribe/openPangu-2.0-Flash (They haven't uploaded it to Huggingface yet)
openPangu-2.0-Flash is a 92B MoE model with 6B activated parameters, 512k context, trained on Ascend with 34T tokens, incorporating slow/fast thinking and multiple RL training stages.
@sheriyuo: The industry's first trillion-parameter model to complete end-to-end training and inference on a 50,000-GPU Chinese com…
Meituan released LongCat-2.0, a 1.6T-parameter MoE model with 1M context, claimed as the first to train on a 50,000-GPU Chinese cluster, now available on OpenRouter for agentic coding.
@witcheer: can’t believe gpt-oss-20b perfs on 8GB vRAM 21B total params, 3.6B active (MoE). OpenAI, Apache 2.0. uses only 1.8 GB V…
A new open-source MoE model, gpt-oss-20b (21B total, 3.6B active), runs on only 1.8GB VRAM and achieves perfect scores on agentic coding tasks, outperforming other local models like Gemma and Qwen.
@FeitengLi: OpenBMB open-sources MiniCPM-V 4.6, 1.3B parameters (SigLIP2-400M + Qwen3.5-0.8B), 262k context, visual encoding FLOPs 50%+ less than previous generation. Token cost for the same task is lower than Qwen3.5-0…
OpenBMB releases MiniCPM-V 4.6, a 1.3B-parameter multimodal LLM with 262k context and significantly reduced visual encoding FLOPs, achieving strong benchmark performance and broad inference framework support.