Qwen3.7 Preview lands on Arena (1 minute read)

TLDR AI Models

Summary

Alibaba Qwen announces two major model releases: Qwen3-Omni, the first natively end-to-end omni-modal AI unifying text, image, audio and video, and Qwen3-Next-80B-A3B, an ultra-efficient MoE model with 3B activated parameters per token, achieving SOTA performance and 10x faster inference than Qwen3-32B.

Qwen3.7 Preview is now on Arena for Text and Vision. Qwen3.7 Max Preview ranks 13th overall in Text Arena, while Qwen3.7 Plus Preview ranks 16th overall in Vision Arena.
Original Article
View Cached Full Text

Cached at: 05/20/26, 12:21 AM

# Thread by @Alibaba_Qwen on Thread Reader App Source: [https://threadreaderapp.com/thread/2056403591464984753.html](https://threadreaderapp.com/thread/2056403591464984753.html) ## More from @Alibaba\_Qwen [![Qwen Profile picture](https://pbs.twimg.com/profile_images/1894073235379273728/0ROUmdkE_bigger.jpg)](https://threadreaderapp.com/user/Alibaba_Qwen) Sep 22, 2025 🚀 Introducing Qwen3\-Omni — the first natively end\-to\-end omni\-modal AI unifying text, image, audio & video in one model — no modality trade\-offs\! 🏆 SOTA on 22/36 audio & AV benchmarks 🌍 119L text / 19L speech in / 10L speech out ⚡ 211ms latency \| 🎧 30\-min audio understanding 🎨 Fully customizable via system prompts 🔗 Built\-in tool calling 🎤 Open\-source Captioner model \(low\-hallucination\!\) 🌟 What’s Open\-Sourced? We’ve open\-sourced Qwen3\-Omni\-30B\-A3B\-Instruct, Qwen3\-Omni\-30B\-A3B\-Thinking, and Qwen3\-Omni\-30B\-A3B\-Captioner, to empower developers to explore a variety of applications from instruction\-following to creative tasks\. Try it now 👇 💬 Qwen Chat:[chat\.qwen\.ai/?models=qwen3\-…](https://chat.qwen.ai/?models=qwen3-omni-flash) 💻 GitHub:[github\.com/QwenLM/Qwen3\-O…](https://github.com/QwenLM/Qwen3-Omni) 🤗 HF Models:[huggingface\.co/collections/Qw…](https://huggingface.co/collections/Qwen/qwen3-omni-68d100a86cd0906843ceccbe) 🤖 MS Models: [modelscope\.cn/collections/Qw…](https://modelscope.cn/collections/Qwen3-Omni-867aef131e7d4f) 🎬 Demo:[huggingface\.co/spaces/Qwen/Qw…](https://huggingface.co/spaces/Qwen/Qwen3-Omni-Demo)[![Image](https://threadreaderapp.com/images/1px.png)](https://pbs.twimg.com/media/G1d9f1ZawAA0DRH.jpg) Use the voice chat and video chat features on Qwen Chat to experience the Qwen3\-Omni model\. Performance[![Image](https://threadreaderapp.com/images/1px.png)](https://pbs.twimg.com/media/G1eAq2KaoAI2beT.jpg) Read 4 tweets [![Qwen Profile picture](https://pbs.twimg.com/profile_images/1894073235379273728/0ROUmdkE_bigger.jpg)](https://threadreaderapp.com/user/Alibaba_Qwen) Sep 11, 2025 🚀 Introducing Qwen3\-Next\-80B\-A3B — the FUTURE of efficient LLMs is here\! 🔹 80B params, but only 3B activated per token → 10x cheaper training, 10x faster inference than Qwen3\-32B\.\(esp\. @ 32K\+ context\!\) 🔹Hybrid Architecture: Gated DeltaNet \+ Gated Attention → best of speed & recall 🔹 Ultra\-sparse MoE: 512 experts, 10 routed \+ 1 shared 🔹 Multi\-Token Prediction → turbo\-charged speculative decoding 🔹 Beats Qwen3\-32B in perf, rivals Qwen3\-235B in reasoning & long\-context 🧠 Qwen3\-Next\-80B\-A3B\-Instruct approaches our 235B flagship\. 🧠 Qwen3\-Next\-80B\-A3B\-Thinking outperforms Gemini\-2\.5\-Flash\-Thinking\. Try it now:[chat\.qwen\.ai](https://chat.qwen.ai/) Blog:[qwen\.ai/blog?id=4074cc…](https://qwen.ai/blog?id=4074cca80393150c248e508aa62983f9cb7d27cd&from=research.latest-advancements-list) Huggingface:[huggingface\.co/collections/Qw…](https://huggingface.co/collections/Qwen/qwen3-next-68c25fd6838e585db8eeea9d) ModelScope:[modelscope\.cn/collections/Qw…](https://modelscope.cn/collections/Qwen3-Next-c314f23bd0264a) Kaggle:[kaggle\.com/models/qwen\-lm…](https://www.kaggle.com/models/qwen-lm/qwen3-next-80b) Alibaba Cloud API:[alibabacloud\.com/help/en/model\-…](https://www.alibabacloud.com/help/en/model-studio/models#c5414da58bjgj)[![Image](https://threadreaderapp.com/images/1px.png)](https://pbs.twimg.com/media/G0lVrz1aMAMqnVP.jpg) Pretraining Efficiency & Inference Speed[![Image](https://threadreaderapp.com/images/1px.png)](https://pbs.twimg.com/media/G0ldAWZakAAGBdx.jpg) Prefill Stage: At 4K context length, throughput is nearly 7x higher than Qwen3\-32B\. Beyond 32K, it’s over 10x faster\.[![Image](https://threadreaderapp.com/images/1px.png)](https://pbs.twimg.com/media/G0ldN9sbAAAwSft.jpg) Read 9 tweets

Similar Articles

Qwen 3.7 Preview

Hacker News Top

Alibaba releases Qwen3.7-Max-Preview and Qwen3.7-Plus-Preview on Arena, achieving top rankings in Text and Vision categories.

Qwen 3.7 Max

Reddit r/LocalLLaMA

Qwen 3.7 is an impressive new AI model from Chinese labs, with discussion on whether weights will be available for download.

Qwen3.6-Max-Preview

Product Hunt

Alibaba releases Qwen3.6-Max-Preview, a flagship model optimized for agentic coding tasks.