Tag
Famed short seller Michael Burry has reportedly established approximately $1 billion in short positions betting on an AI bubble collapse, targeting primarily Palantir ($912M) and NVIDIA ($187M). This is his largest short play since the 2008 financial crisis.
This article explores the feasibility of using an external NVIDIA RTX 5090 GPU with an Apple Silicon Mac via Thunderbolt for CUDA inference and gaming, covering methods like tinygrad eGPU drivers and PCI passthrough to a Linux VM.
An opinion piece highlighting the thriving DGX Spark developer community that is collaboratively optimizing the hardware despite its limitations, with projects like Sparkrun and PrismaQuant.
NVIDIA utilized late interaction, a form of sparse attention, for an attention-based encoder-decoder to retrieve directly from internal representations.
cuda-oxide is an experimental Rust-to-CUDA compiler that allows developers to write safe, idiomatic Rust GPU kernels that compile directly to PTX.
cuda-oxide is an experimental Rust-to-CUDA compiler backend released by NVIDIA, enabling pure Rust GPU kernel development without foreign language bindings.
The article summarizes the current state of the AI data industry, pointing out that the data industry is not yet mature. Anthropic and OpenAI spend over $10 million on a single environment, while Chinese AI labs tend to build rather than buy. In addition, many labs have access to Huawei chips but still crave more Nvidia chips.
NVIDIA CEO Jensen Huang at the Milken Institute Global Conference discussed how open source AI serves as America's strongest tool for AI security, arguing that more open models means more defenders protecting AI systems.
US Energy Secretary Chris Wright and NVIDIA’s Ian Buck discuss the Genesis Mission, a DOE effort to apply AI to scientific discovery, at the SCSP AI+ Expo. They highlight NVIDIA's partnership in building AI supercomputers and the importance of energy for AI.
Rene Haas's Arm earnings call comments are interpreted as confirming the 'Vera CPU thesis,' suggesting a shift toward dedicated CPU orchestration for agentic AI workloads alongside NVIDIA's GPU infrastructure.
NVIDIA and Unsloth have published a technical guide detailing three low-level optimizations that can accelerate LLM fine-tuning by up to 25%, including packed-sequence caching, double-buffered checkpointing, and optimized MoE routing. The guide provides deep systems-level explanations and benchmarks aimed at ML engineers and developers.
NVIDIA’s GeForce NOW now supports Gaijin single sign-on, making it easier to log in to games like War Thunder across devices. The update also adds seven new games and extends RTX 5080 performance to Ultimate members.
NVIDIA's Spectrum-X Ethernet fabric with Multipath Reliable Connection (MRC) sets a new standard for gigascale AI networking, improving throughput, load balancing, and resilience for large-scale AI training, as demonstrated by OpenAI, Microsoft, and Oracle.
NVIDIA and ServiceNow announced a partnership to deliver autonomous AI agents for enterprises, including Project Arc, a self-evolving desktop agent with governance and security powered by NVIDIA accelerated computing and ServiceNow's AI platform.
OpenClaw, an open-source persistent AI assistant, has become the most-starred GitHub project, sparking debate over security and autonomy. NVIDIA is collaborating to enhance security and releasing NemoClaw as a secure reference implementation.
NVIDIA announces 16 games joining GeForce NOW cloud streaming in May, including new AAA titles like Forza Horizon 6 and 007 First Light, and expands RTX 5080-class performance across the library for Ultimate members.
NVIDIA announces Nemotron 3 Nano Omni, an open multimodal model that unifies vision, audio, and language processing to enable faster and more efficient AI agents, achieving up to 9x higher throughput compared to other open omni models.
NVIDIA releases Nemotron 3 Nano Omni, a new long-context multimodal AI model capable of processing documents, audio, video, and text with high accuracy and efficiency.
OpenAI's new GPT-5.5 frontier model now powers Codex, running on NVIDIA GB200 NVL72 systems, and NVIDIA employees are already using it with measurable gains in productivity and debugging speed.
GeForce NOW has launched new in-app labels to help users easily identify games available via Xbox Game Pass and Ubisoft+ subscriptions, alongside adding new titles and rewards.