nvidia

#nvidia

SOLAR: AI-Powered Speed-of-Light Performance Analysis

arXiv cs.LG ↗ · 5h ago Cached

SOLAR is a framework that automatically derives validated speed-of-light performance bounds from PyTorch and JAX source code using an LLM frontend and deterministic analysis, enabling headroom analysis and optimization insights for deep learning workloads.

0 favorites 0 likes

#nvidia

Nemotron-TwoTower: Diffusion Language Modeling with Pretrained Autoregressive Context

arXiv cs.CL ↗ · 5h ago Cached

The paper proposes Nemotron-TwoTower, a diffusion language model that decouples context representation and denoising using a frozen autoregressive tower and a trainable diffusion denoiser, achieving 98.7% of baseline quality with 2.42x throughput.

0 favorites 0 likes

#nvidia

@KevinNaughtonJr: this is the kind of engineering culture that all tech companies should strive for

X AI KOLs Following ↗ · 15h ago Cached

A tweet shares an anecdote about NVIDIA's engineering culture, where the lack of layoffs fosters collaboration instead of internal competition.

0 favorites 0 likes

#nvidia

@timseyde: Dumbo's first steps — LFM2.5-230M doing multi-step tool-calling over pre-trained skills provided by @nvidia SONIC. Same…

X AI KOLs Following ↗ · 19h ago Cached

Liquid AI's LFM2.5-230M model demonstrates multi-step tool-calling capabilities on a Unitree G1 robot, running entirely on-device on an NVIDIA Jetson Orin, acting as a skill-selection layer.

0 favorites 0 likes

#nvidia

The Ultimate Summer Sale Pairing: Steam Sale Meets GeForce NOW Discounts

NVIDIA Blog ↗ · 20h ago Cached

NVIDIA announces GeForce NOW summer sale discounts and new game additions to the cloud gaming library during the Steam Summer Sale, highlighting the benefits of cloud gaming.

0 favorites 0 likes

#nvidia

NVIDIA has released Nemotron-TwoTower-30B-A3B-Base-BF16, an unusual diffusion-based language model built from the Nemotron 3 Nano 30B-A3B backbone.

Reddit r/LocalLLaMA ↗ · yesterday Cached

NVIDIA released Nemotron-TwoTower-30B-A3B-Base-BF16, a diffusion-based language model that uses block-wise autoregressive diffusion to generate text by iterative denoising of token blocks, achieving 2.42× the generation throughput of the autoregressive baseline while retaining 98.7% of benchmark quality.

0 favorites 0 likes

#nvidia

If LLMs are so good at coding…

Reddit r/LocalLLaMA ↗ · yesterday

A discussion questioning why LLMs haven't helped ROCm and Intel's software ecosystems catch up to CUDA, highlighting NVIDIA's premium pricing and the need for genuine market competition.

0 favorites 0 likes

#nvidia

Got GLM-5.2 + MTP speculative decode running on 4× DGX Spark (GB10) — and the build piece the public recipe is missing

Reddit r/LocalLLaMA ↗ · yesterday

The author successfully ran GLM-5.2 with MTP speculative decoding on a 4× DGX Spark (GB10) setup, revealing a missing component in the public build recipe.

0 favorites 0 likes

#nvidia

Nvidia's AI Chips Double in Price in China as It Tackles AI's Water Problem

Reddit r/ArtificialInteligence ↗ · yesterday Cached

Nvidia's AI chips are selling at record high prices in China due to US export restrictions, while the company also announced a new liquid-cooling system to reduce data center water usage.

0 favorites 0 likes

#nvidia

Is AI 'one big bubble'? Behind the tech sell-off

Reddit r/artificial ↗ · 2d ago Cached

The article discusses a sell-off in AI-related tech stocks, raising doubts about whether the massive spending on artificial intelligence will yield returns. It highlights market volatility, with major companies like Micron, Nvidia, and Alphabet experiencing significant drops.

0 favorites 0 likes

#nvidia

NVIDIA and AWS Collaborate to Bring AI to Production at Scale

NVIDIA Blog ↗ · 2d ago Cached

NVIDIA and AWS announce new EC2 G7 instances with NVIDIA RTX PRO 4500 Blackwell GPUs and GPU-accelerated vector search in Amazon OpenSearch Serverless, enabling enterprises to deploy AI at production scale with improved performance and reduced operational complexity.

0 favorites 0 likes

#nvidia

@charles_irl: dflash go brr

X AI KOLs Timeline ↗ · 2d ago Cached

NVIDIA announces DFlash, an open source block diffusion model for speculative decoding that achieves up to 15x higher inference throughput on Blackwell GPUs while maintaining interactivity.

0 favorites 0 likes

#nvidia

NVIDIA's new chips just proved AI "safety" was always theater. We are not ready for 2029.

Reddit r/ArtificialInteligence ↗ · 2d ago

NVIDIA's new chips enable running 500B parameter models locally, highlighting that AI safety measures are merely behavioral speed bumps that vanish offline, posing unprecedented risks for deception and manipulation at scale.

0 favorites 0 likes

#nvidia

AI Bubble about to Burst? Nvidia quietly acquihires Essential AI team, including Transformer coauthor Ashish Vaswani. Vaswani was struggling to raise money for his AI company.

Reddit r/ArtificialInteligence ↗ · 2d ago

Nvidia has quietly acquihired the team from Essential AI, including Transformer paper coauthor Ashish Vaswani, who was struggling to raise funds for his startup. Vaswani will work on Nvidia's Nemotron open-source models.

0 favorites 0 likes

#nvidia

@arcinstitute: The future of biology is agentic. We're proud to work with NVIDIA on the Evo series of models and are excited to see th…

X AI KOLs Following ↗ · 2d ago Cached

NVIDIA launches the BioNeMo Agent Toolkit, an open toolkit that enables AI agents to perform tasks like protein structure prediction, molecular docking, and generative chemistry, accelerating programmable biology in collaboration with Arc Institute.

0 favorites 0 likes

#nvidia

I'm eager for a 15x speedup on my strix halo

Reddit r/LocalLLaMA ↗ · 2d ago

Nvidia claims a 15x speedup in text generation using a diffusion model, generating entire blocks at once.

0 favorites 0 likes

#nvidia

xAI posted 65 jobs for biology, physics, and chemistry tutors. Tracked hiring data across 8 AI labs to see what each one's actually building toward

Reddit r/ArtificialInteligence ↗ · 2d ago

Analyzes hiring data across major AI labs to infer strategic directions, noting xAI's focus on scientific tutors, Nvidia's data center push, and OpenAI's engineering growth.

0 favorites 0 likes

#nvidia

UPDATE: Qwen-27B-IQ4_KS and Qwen-27B-IQ_KS_KT for ik_llama.cpp, especially for NVIDIA with 16GB VRAM

Reddit r/LocalLLaMA ↗ · 2d ago

New GGUF quantizations of Qwen3.6-27B optimized for 16GB VRAM NVIDIA GPUs, including an experimental Trellis variant, with perplexity benchmarks.

0 favorites 0 likes

#nvidia

@aijoey: for all my new dgx spark owners. https://github.com/joeynyc/spark-doctor…

X AI KOLs Timeline ↗ · 2d ago Cached

Spark Doctor is an open-source diagnostic CLI for NVIDIA DGX Spark that collects system, GPU, memory, Docker, and recipe data, applies specific rules, and outputs the likely cause and next steps for common issues.

0 favorites 0 likes

#nvidia

@PyTorch: While SGLang provided Day-0 support for DeepSeek-V4, the collaboration between the @lmsysorg and @NVIDIAAI engineering …

X AI KOLs Following ↗ · 2d ago Cached

SGLang provided Day-0 support for DeepSeek-V4, and collaboration between LMSys and NVIDIA engineering teams achieved up to 5x throughput increase in production, with improvements shown on the SemiAnalysis InferenceX dashboard.

0 favorites 0 likes

nvidia

Submit Feedback