@AdinaYakup: Ovis2.6-80B-A3B > new MoE multimodal LLM from Alibaba's AIDC team 80B/3B active Apache2.0 64K context / 2880×2880 image…
Summary
Alibaba's AIDC team has released Ovis2.6-80B-A3B, an Apache 2.0 licensed Mixture of Experts multimodal LLM featuring 80B total parameters with 3B active, 64K context length, and native support for 2880×2880 images with Chain-of-Thought visual reasoning.
View Cached Full Text
Cached at: 05/12/26, 12:53 PM
Ovis2.6-80B-A3B > new MoE multimodal LLM from Alibaba’s AIDC team
✨ 80B/3B active ✨ Apache2.0 ✨ 64K context / 2880×2880 image resolution ✨ “Think with Image” : active visual reasoning in CoT https://t.co/08FpVf0aDd
Similar Articles
AIDC-AI/Ovis2.6-80B-A3B · Hugging Face
Ovis2.6-80B-A3B is a new Multimodal Large Language Model released by AIDC-AI, featuring a Mixture-of-Experts architecture with 80B total parameters but only 3B active during inference. It offers enhanced long-context processing, high-resolution understanding, and active visual reasoning capabilities.
@AdinaYakup: Step-3.7-Flash New VL model from @StepFun_ai 198B / 11B active - MoE 256K context 3 reasoning level Up to 400 tokens/sec
StepFun releases Step-3.7-Flash, a new large vision-language MoE model with 198B parameters (11B active), 256K context, and up to 400 tokens/sec inference speed.
@FeitengLi: OpenBMB open-sources MiniCPM-V 4.6, 1.3B parameters (SigLIP2-400M + Qwen3.5-0.8B), 262k context, visual encoding FLOPs 50%+ less than previous generation. Token cost for the same task is lower than Qwen3.5-0…
OpenBMB releases MiniCPM-V 4.6, a 1.3B-parameter multimodal LLM with 262k context and significantly reduced visual encoding FLOPs, achieving strong benchmark performance and broad inference framework support.
Liquid AI reveals 8B-A1B MoE trained on 38T
Liquid AI released LFM2.5-8B-A1B, an edge MoE model trained on 38T tokens with a 128K context window, improved tool calling, and reasoning capabilities, available on Hugging Face.
@AdinaYakup: MOSS-VL Vision model from @Open_MOSS Model: https://huggingface.co/collections/OpenMOSS-Team/moss-vl… Demo: https://hug…
Open_MOSS released MOSS-VL, an 11B Apache 2.0 vision-language model using cross-attention and XRoPE that outperforms Qwen3-VL-8B by 8.3 points on VSI-bench.