@ctnzr: We've gone even farther: Nemotron 3 Super is 120B and pretrained on 25T tokens in NVFP4. Nemotron 3 Ultra is ~500B and …

X AI KOLs Following Models

Summary

NVIDIA announces Nemotron 3 Super (120B) and Nemotron 3 Ultra (~500B) models, pretrained on 25T tokens using NVFP4 precision, emphasizing accelerated computing and efficiency improvements.

We've gone even farther: Nemotron 3 Super is 120B and pretrained on 25T tokens in NVFP4. Nemotron 3 Ultra is ~500B and also pretrained in NVFP4. Accelerated computing means we rethink every aspect of the AI stack looking for new opportunities to improve efficiency.
Original Article
View Cached Full Text

Cached at: 05/15/26, 11:08 PM

We’ve gone even farther: Nemotron 3 Super is 120B and pretrained on 25T tokens in NVFP4. Nemotron 3 Ultra is ~500B and also pretrained in NVFP4.

Accelerated computing means we rethink every aspect of the AI stack looking for new opportunities to improve efficiency.

Similar Articles

NVIDIA Nemotron 3 Ultra is out.

Reddit r/LocalLLaMA

NVIDIA has released Nemotron 3 Ultra, a new model designed to power faster and more efficient reasoning for long-running AI agents.

Nemotron 3 Ultra by NVIDIA

Product Hunt

NVIDIA introduces Nemotron 3 Ultra, a new AI model designed to enable faster and more efficient reasoning for long-running agents.

NVIDIA just announced the release of Nemotron 3 Ultra (2 minute read)

TLDR AI

Anthropic released Claude Opus 4.5, its most intelligent model, scoring 70 on the Artificial Analysis Intelligence Index and ranking second only to Gemini 3 Pro. It achieves significant gains in coding and agentic tasks while reducing per-token pricing and maintaining strong safety performance.