open-weights

#open-weights

@MaximeRivest: nemotron 550b ultra from nvidia just dropped! it's tool calling and standard system prompt is very very very clean and …

X AI KOLs Following ↗ · 2026-06-04

NVIDIA released Nemotron 550B Ultra, a large language model featuring a clean XML-based tool calling interface instead of JSON schemas, with tool results delivered as user messages in XML tags.

0 favorites 0 likes

#open-weights

@PrajwalTomar_: Everyone's sleeping on MiniMax. Again. They just shipped M3. The first open-weights model to combine frontier coding, 1…

X AI KOLs Following ↗ · 2026-06-04 Cached

MiniMax released M3, an open-weights model combining frontier coding, 1M context, and native multimodality, offering comparable performance to Opus at a fraction of the cost.

0 favorites 0 likes

#open-weights

nvidia/NVIDIA-Nemotron-3-Ultra-550B-A55B-BF16 · Hugging Face

Reddit r/LocalLLaMA ↗ · 2026-06-04

NVIDIA releases Nemotron-3-Ultra-550B-A55B, a 550B parameter (55B active) frontier LLM featuring a hybrid LatentMoE architecture combining Mamba-2, MoE, and Attention layers, with up to 1M token context length and configurable reasoning mode. It supports 11 languages and is optimized for complex agentic workflows, long-context analysis, and high-accuracy reasoning.

0 favorites 0 likes

#open-weights

Gemma 2B multimodal model matches larger models without encoder

Reddit r/singularity ↗ · 2026-06-04

Google's Gemma 4 12B introduces an encoder-free multimodal architecture that competes with larger models, though benchmark comparisons show it trailing Qwen 2.5 9B on most tasks. The article also covers related developments including open-weight model security risks, Uber's Claude Code spending caps, and NeurIPS's misuse of an uncalibrated AI detector.

0 favorites 0 likes

#open-weights

@svpino: Humans have an average of 200-250 ms of latency when speaking to each other. This voice model is even faster: only 110 …

X AI KOLs Following ↗ · 2026-06-03

An open-weights 8B parameter voice model achieves only 110ms latency, faster than average human conversation latency of 200-250ms. It can be run locally and is freely available via a GitHub repository.

0 favorites 0 likes

#open-weights

@HuggingPapers: Google just released Magenta RealTime 2 on Hugging Face The only open-weights model for real-time continuous music gene…

X AI KOLs Following ↗ · 2026-06-03 Cached

Google released Magenta RealTime 2 on Hugging Face, an open-weights model for real-time continuous music generation on device with ~200ms latency, steerable by text, audio, or MIDI.

0 favorites 0 likes

#open-weights

nvidia/NVIDIA-Nemotron-3-Ultra-550B-A55B-NVFP4

Hugging Face Models Trending ↗ · 2026-06-03 Cached

NVIDIA releases Nemotron-3-Ultra, a 550B-parameter open-weight model with a hybrid architecture combining Mamba-2, MoE, and attention, supporting up to 1M token context and configurable reasoning mode.

0 favorites 0 likes

#open-weights

Microsoft Aion 1.0 Instruct and Aion 1.0 Plan models!

Reddit r/LocalLLaMA ↗ · 2026-06-03

Microsoft announced two new on-device AI models at Build 2026: Aion 1.0 Instruct, an open-weights small language model, and Aion 1.0 Plan, a 14B parameter reasoning and tool-calling model for local agentic workflows.

0 favorites 0 likes

#open-weights

@swyx: roundup of links:

X AI KOLs Following ↗ · 2026-06-02 Cached

NVIDIA releases Cosmos 3 (Mixture-of-Transformers models up to 64B), Nemotron 3 Ultra (550B-A55B LLM), and previews RTX Spark personal superchip at Computex 2026, achieving SOTA on multiple open model leaderboards.

0 favorites 0 likes

#open-weights

I just created a detailed report based on the DeepSWE benchmark data

Reddit r/singularity ↗ · 2026-06-01

An analysis of the DeepSWE benchmark data reveals surprising cost and performance differences among models, with GPT 5.5 leading in capability and cost efficiency while open weights models can be expensive per pass.

0 favorites 0 likes

#open-weights

@RyanLeeMiniMax: MiniMax-M3 will by arrive on HuggingFace openweight at next week!

X AI KOLs Following ↗ · 2026-06-01 Cached

MiniMax announced MiniMax-M3, an open-weights model combining frontier coding and agentic capabilities with sparse attention scaling to 1M context, set to arrive on HuggingFace next week.

0 favorites 0 likes

#open-weights

@MiniMax_AI: Introducing MiniMax M3: The First Open-Weights Model to Combine Three Frontier Capabilities - Coding & Agentic Frontier…

X AI KOLs Timeline ↗ · 2026-06-01 Cached

MiniMax unveils MiniMax M3, the first open-weights AI model combining frontier capabilities in coding and agentic tasks, achieving strong benchmark scores with sparse attention scaling to 1M context.

0 favorites 0 likes

#open-weights

MiniMax M3 (2 minute read)

TLDR AI ↗ · 2026-06-01 Cached

MiniMax introduces M3, the first open-weights model to combine coding, agentic, and multimodal capabilities with up to 1M context via sparse attention.

0 favorites 0 likes

#open-weights

Safety guardrails continue to improve, but what happens if open-weights surpass cloud based models?

Reddit r/artificial ↗ · 2026-05-31

The article explores the implications of open-weight models potentially surpassing cloud-based models in performance, while noting that safety guardrails are improving.

0 favorites 0 likes

#open-weights

1-Bit Bonsai Image 4B Image Generation for Local Devices

Hacker News Top ↗ · 2026-05-31 Cached

PrismML releases Bonsai Image 4B, a family of compact image generation models using 1-bit and ternary weights, enabling high-quality diffusion inference on local devices like laptops and iPhones with significantly reduced memory footprint.

0 favorites 0 likes

#open-weights

Open-weights VLA hits 80%+ task progress on 4 of 17 real-robot tasks with zero fine-tuning. Demo reel attached

Reddit r/singularity ↗ · 2026-05-31

Release of Wall-OSS-0.5, an open-weights vision-language-action model that achieves over 80% task progress on 4 of 17 real-robot tasks with zero fine-tuning, including on a deformable rope task not seen during pretraining. The model preserves general vision-language ability while improving embodied grounding.

0 favorites 0 likes

#open-weights

G7 agrees on shared language around open-source AI, open weights AI

Reddit r/artificial ↗ · 2026-05-30 Cached

The G7 Digital and Technology Ministers reached a consensus on shared terminology for open-source and open-weights AI, defining categories like Open Source AI with Open Data, Open Source AI, Open Weights AI, and Weights Available AI to standardize discussions around AI openness.

0 favorites 0 likes

#open-weights

ideogram-ai/ideogram-4-nf4

Hugging Face Models Trending ↗ · 2026-05-30 Cached

Ideogram has released Ideogram 4, their first open-weight text-to-image model trained from scratch, featuring state-of-the-art multilingual text rendering, JSON-structured prompting, bounding-box layout controls, and native 2K resolution output. The NF4-quantized version is available on Hugging Face, with the model claimed to be the best open-weight image model and competitive with proprietary frontier models.

0 favorites 0 likes

#open-weights

@LangChain: The latest finding in the LangSmith Signal: Open Models are having a moment. 1 in 3 AI teams ran an open-weights model …

X AI KOLs Timeline ↗ · 2026-05-29 Cached

LangSmith Signal reports that 1 in 3 AI teams now run open-weights models, up from 1 in 5 nine months ago, with overall usage growing 3x.

0 favorites 0 likes

#open-weights

Step 3.7 Flash open weights dropped TODAY and the agent reliability numbers are actually interesting

Reddit r/artificial ↗ · 2026-05-29

Step 3.7 Flash, an open-weight 198B sparse MoE model, claims 98% agent reliability on tau2-bench across all difficulty levels, with mid raw capability but strong multi-step consistency.

0 favorites 0 likes

open-weights

Submit Feedback