nemotron

#nemotron

Towards Speed-of-Light Text Generation with Nemotron-Labs Diffusion Language Models

Hugging Face Blog ↗ · 2026-05-23 Cached

NVIDIA introduces Nemotron-Labs Diffusion, a family of diffusion language models that generate text in parallel and iteratively refine it, offering faster generation and the ability to revise previous tokens.

0 favorites 0 likes

#nemotron

nvidia/nemotron-3.5-asr-streaming-0.6b

Hugging Face Models Trending ↗ · 2026-05-15 Cached

NVIDIA releases Nemotron 3.5 ASR, a 600M parameter multilingual streaming speech recognition model supporting 40 language-locales with a Cache-Aware FastConformer-RNNT architecture for low-latency transcription. The model supports configurable chunk sizes and is ready for commercial use under the OpenMDW-1.1 license.

0 favorites 0 likes

#nemotron

@ctnzr: We've gone even farther: Nemotron 3 Super is 120B and pretrained on 25T tokens in NVFP4. Nemotron 3 Ultra is ~500B and …

X AI KOLs Following ↗ · 2026-05-15 Cached

NVIDIA announces Nemotron 3 Super (120B) and Nemotron 3 Ultra (~500B) models, pretrained on 25T tokens using NVFP4 precision, emphasizing accelerated computing and efficiency improvements.

0 favorites 1 likes

#nemotron

MLX 16/8/4/2-bit quants of nvidia/llama-embed-nemotron-8b

Reddit r/LocalLLaMA ↗ · 2026-05-14

The user converted Nvidia's Llama-Embed-Nemotron-8B model to MLX format with fp16, 8-bit, 4-bit, and 2-bit quantizations, enabling in-process embedding loading on Apple Silicon via mlx-embeddings.

0 favorites 0 likes

#nemotron

@VincentLogic: NVIDIA really went all out this time, directly releasing an open-source video understanding monster Nemotron 3 Nano Omni that processes video at an insane speed: 1 hour to handle 10 hours of video content, 10 times faster than playback speed. The core relies on 3D convolution technology, no longer scanning frame by frame, but instead…

X AI KOLs Timeline ↗ · 2026-05-14

NVIDIA has open-sourced the video understanding model Nemotron 3 Nano Omni, which uses 3D convolution technology and processes video 10 times faster than playback speed. It excels at audio-video analysis, surveillance retrieval, and asset tagging, but is not suitable for code or text inference tasks.

0 favorites 0 likes

#nemotron

Nemotron Labs: What OpenClaw Agents Mean for Every Organization

NVIDIA Blog ↗ · 2026-04-30 Cached

OpenClaw, an open-source persistent AI assistant, has become the most-starred GitHub project, sparking debate over security and autonomy. NVIDIA is collaborating to enhance security and releasing NemoClaw as a secure reference implementation.

0 favorites 0 likes

#nemotron

NVIDIA Launches Nemotron 3 Nano Omni Model, Unifying Vision, Audio and Language for up to 9x More Efficient AI Agents

NVIDIA Blog ↗ · 2026-04-28 Cached

NVIDIA announces Nemotron 3 Nano Omni, an open multimodal model that unifies vision, audio, and language processing to enable faster and more efficient AI agents, achieving up to 9x higher throughput compared to other open omni models.

0 favorites 0 likes

#nemotron

nvidia/Nemotron-3-Nano-Omni-30B-A3B-Reasoning-BF16

Hugging Face Models Trending ↗ · 2026-04-20 Cached

NVIDIA releases Nemotron 3 Nano Omni, a 30B parameter multimodal model capable of processing video, audio, image, and text with integrated reasoning capabilities for enterprise workflows.

0 favorites 0 likes

#nemotron

Building a Fast Multilingual OCR Model with Synthetic Data

Hugging Face Blog ↗ · 2026-04-17 Cached

NVIDIA introduces Nemotron OCR v2, a fast multilingual OCR model built using synthetic data generation. The model achieves 34.7 pages/second on a single A100 GPU by using a unified FOTS-based architecture with feature reuse across detection, recognition, and relational components.

0 favorites 0 likes

nemotron

Submit Feedback