Why there is a lack of new 100B-120B models?

Reddit r/LocalLLaMA News

Summary

Analysis of the trend in AI model sizes, noting a gap in the 100-120B parameter range with recent releases focusing on smaller (25-35B) or larger (200B+) models.

GPT-OSS-120B was the first model of that family, which was followed by GLM-4.5-Air, Nemotron-3-Super, Qwen3.5-122B, Mistral-Small-4-119B. However, all models are at least 3 months old (10 months for GPT-OSS-120B) and all latest releases are either 25B-35B (Gemma4, Qwen3.6) or 200B+ (Step 3.5/3.7 Flash, DeepSeek-V4-Flash, MiniMax-M3, Nemotron-3-Ultra). Did the \~120B MoE family "die" like the 70B/80B one or there will likely be new releases for H2 2026?
Original Article

Similar Articles