@ctnzr: We've gone even farther: Nemotron 3 Super is 120B and pretrained on 25T tokens in NVFP4. Nemotron 3 Ultra is ~500B and …
Summary
NVIDIA announces Nemotron 3 Super (120B) and Nemotron 3 Ultra (~500B) models, pretrained on 25T tokens using NVFP4 precision, emphasizing accelerated computing and efficiency improvements.
View Cached Full Text
Cached at: 05/15/26, 11:08 PM
We’ve gone even farther: Nemotron 3 Super is 120B and pretrained on 25T tokens in NVFP4. Nemotron 3 Ultra is ~500B and also pretrained in NVFP4.
Accelerated computing means we rethink every aspect of the AI stack looking for new opportunities to improve efficiency.
Similar Articles
NVIDIA Nemotron 3 Ultra is out.
NVIDIA has released Nemotron 3 Ultra, a new model designed to power faster and more efficient reasoning for long-running AI agents.
Nemotron 3 Ultra by NVIDIA
NVIDIA introduces Nemotron 3 Ultra, a new AI model designed to enable faster and more efficient reasoning for long-running agents.
Nemotron 3 Ultra. 550 billion parameters, 55B active. 1 million context
NVIDIA releases Nemotron 3 Ultra, a massive 550 billion parameter mixture-of-experts model with 55B active parameters and a 1 million token context window.
NVIDIA just announced the release of Nemotron 3 Ultra (2 minute read)
Anthropic released Claude Opus 4.5, its most intelligent model, scoring 70 on the Artificial Analysis Intelligence Index and ranking second only to Gemini 3 Pro. It achieves significant gains in coding and agentic tasks while reducing per-token pricing and maintaining strong safety performance.
@mervenoyann: NVIDIA Nemotron Ultra is here > 55B/550B a hybrid MoE  with 1M context window > supports MTP speculative decoding > da…
NVIDIA released Nemotron Ultra, a hybrid MoE model with 55B/550B parameters and a 1M context window, supporting MTP speculative decoding and available day-0 in transformers.