nemotron

Tag

Cards List
#nemotron

NVIDIA has released Nemotron-TwoTower-30B-A3B-Base-BF16, an unusual diffusion-based language model built from the Nemotron 3 Nano 30B-A3B backbone.

Reddit r/LocalLLaMA · yesterday Cached

NVIDIA released Nemotron-TwoTower-30B-A3B-Base-BF16, a diffusion-based language model that uses block-wise autoregressive diffusion to generate text by iterative denoising of token blocks, achieving 2.42× the generation throughput of the autoregressive baseline while retaining 98.7% of benchmark quality.

0 favorites 0 likes
#nemotron

@TheAhmadOsman: Don't sleep on Nemotron 3 Ultra Sometimes surprises me that it is more intelligently capable than GPT 5.5

X AI KOLs Following · 2026-06-16 Cached

The author claims that Nemotron 3 Ultra is more intelligently capable than GPT 5.5.

0 favorites 0 likes
#nemotron

@ziv_ravid: 1/I read the Nemotron 3 Ultra report and it's interesting to compare their post-training to DeepSeek V4's. Both now do …

X AI KOLs Timeline · 2026-06-15 Cached

The tweet compares the post-training methods of Nemotron 3 Ultra and DeepSeek V4, noting both use multiple specialist teachers and on-policy distillation into a single student, but differ in support overlap.

0 favorites 0 likes
#nemotron

Nemotron - King of the Deep? Comparison of 4 models <=120B

Reddit r/LocalLLaMA · 2026-06-14

Comparison of four large language models (≤120B parameters) on deep context performance using Strix Halo hardware. Nemotron Super excels in prompt processing speed at deep context depths compared to GPT-OSS and Qwen models.

0 favorites 0 likes
#nemotron

@PrajwalTomar_: Nous Research and NVIDIA just converged on the same idea. Not a coding tool. Not a copilot. An agent that lives on your…

X AI KOLs Timeline · 2026-06-10 Cached

Nous Research and NVIDIA have independently converged on the same architecture for persistent AI agents that live on servers and improve daily, marking a shift from coding copilots to autonomous server-side agents.

0 favorites 0 likes
#nemotron

@llm_wizard: btw, we publish everything you need to build our Nemotron models including the recipes and pipelines directly. https://…

X AI KOLs Following · 2026-06-09 Cached

NVIDIA released the Nemotron repository with open training recipes, pipelines, and model weights for their Nemotron models, including the new Nemotron 3 Ultra and Nemotron 3 Nano Omni, supporting agentic AI and multimodal capabilities.

0 favorites 0 likes
#nemotron

@auroter: Frontier AI is BRAINDEAD. GPT5.5 xHigh in Codex thinks I should use Tensor Parallelism to deploy Qwen 3.6 27B on my sys…

X AI KOLs Following · 2026-06-08 Cached

The author criticizes Frontier AI (GPT5.5 xHigh) for incorrectly suggesting Tensor Parallelism for a model that fits on a single GPU, and announces a planned shootout comparing several AI models (GPT5.5, Opus 4.8, Qwen variants, Nemotron) on a real-world problem.

0 favorites 0 likes
#nemotron

Free Top Tier Models for OpenClaw

Reddit r/openclaw · 2026-06-08

A user shares that NVIDIA is currently offering top-tier AI models like Nemotron Ultra, DS4flash, Kimi, GLM, and Minimax3 for free with rate limiting, potentially benefiting personal users.

0 favorites 0 likes
#nemotron

@cyrilXBT: Nemotron 3 Ultra versus DeepSeek V4 versus MiniMax M3 versus Qwen 3.7 Max. Same two prompts. Four frontier models. One …

X AI KOLs Following · 2026-06-06 Cached

A comparison of four frontier AI models (Nemotron 3 Ultra, DeepSeek V4, MiniMax M3, Qwen 3.7 Max) on the same two prompts, with full results linked.

0 favorites 0 likes
#nemotron

@0xSero: Nvidia’s Nemotron series is the most open source series of models. I found: - benchmark asks - all the GitHub repos - a…

X AI KOLs Following · 2026-06-04 Cached

Nvidia's Nemotron series of AI models is fully open source, with benchmarks, GitHub repos, data, and weights available, performing competitively with NVFP4 benchmarks only 1% away.

0 favorites 0 likes
#nemotron

@rasbt: And another open-weight release. Nemotron 3 Ultra has an ultra impressive capability:efficiency ratio! Design-wise, it …

X AI KOLs Timeline · 2026-06-04 Cached

Nemotron 3 Ultra is an open-weight release with an impressive capability-to-efficiency ratio, using a Mamba-2-attention hybrid stack and LatentMoE, and is larger than the previous Super variant.

0 favorites 0 likes
#nemotron

NVIDIA Nemotron 3 Ultra is out.

Reddit r/LocalLLaMA · 2026-06-04

NVIDIA has released Nemotron 3 Ultra, a new model designed to power faster and more efficient reasoning for long-running AI agents.

0 favorites 0 likes
#nemotron

@kwindla: https://x.com/kwindla/status/2062544580105359686

X AI KOLs Timeline · 2026-06-04 Cached

NVIDIA released Nemotron 3.5 ASR, an open-source multilingual speech-to-text model with the lowest latency tested, available in multilingual and English-only variants, ideal for voice agents and self-hosted deployments.

0 favorites 0 likes
#nemotron

@mervenoyann: NVIDIA Nemotron Ultra is here > 55B/550B a hybrid MoE  with 1M context window > supports MTP speculative decoding > da…

X AI KOLs Following · 2026-06-04 Cached

NVIDIA released Nemotron Ultra, a hybrid MoE model with 55B/550B parameters and a 1M context window, supporting MTP speculative decoding and available day-0 in transformers.

0 favorites 0 likes
#nemotron

nvidia/NVIDIA-Nemotron-3-Ultra-550B-A55B-NVFP4

Hugging Face Models Trending · 2026-06-03 Cached

NVIDIA releases Nemotron-3-Ultra, a 550B-parameter open-weight model with a hybrid architecture combining Mamba-2, MoE, and attention, supporting up to 1M token context and configurable reasoning mode.

0 favorites 0 likes
#nemotron

@FinanceYF5: Source:

X AI KOLs Following · 2026-06-02 Cached

NVIDIA announces the upcoming release of Nemotron 3 Ultra this week.

0 favorites 0 likes
#nemotron

@TheAhmadOsman: My pal Jensen is delivering Frontier Opensource Intelligence (that is extremely cost efficient) just like he said he wo…

X AI KOLs Following · 2026-06-01 Cached

Jensen Huang hints at more Nemotron model releases, highlighting open-source frontier intelligence and cost efficiency enabled by NVFP4 training.

0 favorites 0 likes
#nemotron

NVIDIA announces Nemotron 3 Ultra

Reddit r/LocalLLaMA · 2026-06-01

NVIDIA announces the Nemotron 3 Ultra AI model.

0 favorites 0 likes
#nemotron

How to Automate AI Model Documentation with the NVIDIA MCG Toolkit (8 minute read)

TLDR AI · 2026-06-01 Cached

NVIDIA introduces the MCG Toolkit, an automated pipeline that generates compliant model documentation (Model Card++ format) from source code in under a minute, leveraging RAG and NIM microservices.

0 favorites 0 likes
#nemotron

Embeddings for NVIDIA's Nemotron Personas

Reddit r/LocalLLaMA · 2026-05-23

Precomputed embedding vectors for the Nemotron-Personas dataset using Qwen 0.6B, enabling semantic search and clustering of synthetic personas via a web demo.

0 favorites 0 likes
Next →
← Back to home

Submit Feedback