@ctnzr: We've gone even farther: Nemotron 3 Super is 120B and pretrained on 25T tokens in NVFP4. Nemotron 3 Ultra is ~500B and …

X AI KOLs Following 05/15/26, 09:01 PM Models

nvidia nemotron llm pretraining nvfp4 efficiency

Summary

NVIDIA announces Nemotron 3 Super (120B) and Nemotron 3 Ultra (~500B) models, pretrained on 25T tokens using NVFP4 precision, emphasizing accelerated computing and efficiency improvements.

We've gone even farther: Nemotron 3 Super is 120B and pretrained on 25T tokens in NVFP4. Nemotron 3 Ultra is ~500B and also pretrained in NVFP4. Accelerated computing means we rethink every aspect of the AI stack looking for new opportunities to improve efficiency.

Original Article

View Cached Full Text

Cached at: 05/15/26, 11:08 PM

We’ve gone even farther: Nemotron 3 Super is 120B and pretrained on 25T tokens in NVFP4. Nemotron 3 Ultra is ~500B and also pretrained in NVFP4.

Accelerated computing means we rethink every aspect of the AI stack looking for new opportunities to improve efficiency.

Similar Articles

NVIDIA Nemotron 3 Ultra is out.

Reddit r/LocalLLaMA

NVIDIA has released Nemotron 3 Ultra, a new model designed to power faster and more efficient reasoning for long-running AI agents.

Nemotron 3 Ultra by NVIDIA

Product Hunt

NVIDIA introduces Nemotron 3 Ultra, a new AI model designed to enable faster and more efficient reasoning for long-running agents.

Nemotron 3 Ultra. 550 billion parameters, 55B active. 1 million context

Reddit r/LocalLLaMA

NVIDIA releases Nemotron 3 Ultra, a massive 550 billion parameter mixture-of-experts model with 55B active parameters and a 1 million token context window.

NVIDIA just announced the release of Nemotron 3 Ultra (2 minute read)

TLDR AI

Anthropic released Claude Opus 4.5, its most intelligent model, scoring 70 on the Artificial Analysis Intelligence Index and ranking second only to Gemini 3 Pro. It achieves significant gains in coding and agentic tasks while reducing per-token pricing and maintaining strong safety performance.

@mervenoyann: NVIDIA Nemotron Ultra is here > 55B/550B a hybrid MoE with 1M context window > supports MTP speculative decoding > da…

X AI KOLs Following

NVIDIA released Nemotron Ultra, a hybrid MoE model with 55B/550B parameters and a 1M context window, supporting MTP speculative decoding and available day-0 in transformers.