open-weights

#open-weights

GLM 5.2 API is live, weights are on HF, and ollama has it already

Reddit r/LocalLLaMA ↗ · 2026-06-16

GLM 5.2 has been released with open weights under MIT license on HuggingFace, available via API and Ollama, featuring competitive benchmarks that trail Opus 4.8 by a point and edge GPT-5.5 by one.

0 favorites 0 likes

#open-weights

GLM-5.2 is the first open-weights model to cross 80% on Terminal-Bench and beats every other open model available

Reddit r/LocalLLaMA ↗ · 2026-06-16

GLM-5.2 is the first open-weights model to exceed 80% on Terminal-Bench, surpassing all other open models and even Gemini, making it a frontier-level model at a fraction of the cost.

0 favorites 0 likes

#open-weights

Claude Fable 5 distilled

Reddit r/LocalLLaMA ↗ · 2026-06-16 Cached

Qwable-v1 is an open-weights agentic coding model (35B MoE, 3B active) built by chaining distills from Claude Opus 4.7 reasoning and Claude Fable-5 agentic tool-use traces. It can think in explicit CoT chains and act as a Claude-Code-style agent when prompted.

0 favorites 0 likes

#open-weights

Why are Huawei's Atlas cards not a thing?

Reddit r/LocalLLaMA ↗ · 2026-06-15

A user questions why Huawei's Atlas cards are not widely adopted and speculates on China's potential to produce consumer GPUs to challenge Nvidia's monopoly.

0 favorites 0 likes

#open-weights

z.ai Poll on X: MIT-licensed open weights are losing

Reddit r/LocalLLaMA ↗ · 2026-06-14

A poll on X shows MIT-licensed open weights are losing with 7 hours left and 1,800 votes cast.

0 favorites 0 likes

#open-weights

Local models in mid-2026

Reddit r/LocalLLaMA ↗ · 2026-06-14 Cached

A technical overview of the state of local AI models in mid-2026, highlighting how open-weight models have narrowed the gap to frontier models through advances in mixture-of-experts and sparse attention, enabling efficient local inference.

0 favorites 0 likes

#open-weights

@awnihannun: The video from @angeloskath on local agentic AI with MLX is excellent. I also hear it's one of the most viewed videos i…

X AI KOLs Following ↗ · 2026-06-12 Cached

A tweet highlights an excellent WWDC video by Angelos Kath on building local agentic AI with MLX, noting rapid progress in open-weight models and hardware capabilities.

0 favorites 0 likes

#open-weights

Cheaper, faster, and culturally aware, Avataar’s video AI is built for India’s scale

TechCrunch AI ↗ · 2026-06-12 Cached

Avataar AI launches Varya, a video generation model optimized for India's scale and cultural context, using distillation from Wan 2.2 to achieve 20x cost reduction and local nuance understanding.

0 favorites 0 likes

#open-weights

Minimax M3 open weights release planned for Friday

Reddit r/LocalLLaMA ↗ · 2026-06-11 Cached

MiniMaxAI announces plans to release open weights for its upcoming M3 model on Friday, following the earlier M2.7 model.

0 favorites 0 likes

#open-weights

DiffusionGemma

Simon Willison's Blog ↗ · 2026-06-10 Cached

Google released DiffusionGemma, an open-weight text generation model (26B parameters, 4B active) under Apache 2 license, demonstrating high inference speeds via NVIDIA's NIM cloud API.

0 favorites 0 likes

#open-weights

@Modular: Our kernel team has been deep in MiniMax M3 all week. The 1M-token context and native multimodality make it a hard mode…

X AI KOLs Following ↗ · 2026-06-09 Cached

Modular's kernel team is optimizing serving for MiniMax M3's 1M-token context and native multimodality, with open weights dropping soon for immediate deployment on Modular.

0 favorites 0 likes

#open-weights

@danveloper: https://x.com/danveloper/status/2064387956387758206

X AI KOLs Timeline ↗ · 2026-06-09 Cached

A developer ran DeepSeek-V4-Flash on a Raspberry Pi 5 by streaming model weights from an NVMe SSD, achieving 1.3 tokens/second at 8 watts, demonstrating the feasibility of frontier-adjacent open-weight models on low-cost, offline hardware.

0 favorites 0 likes

#open-weights

Our ICML paper on predictable hallucination (information-budget abstention gate), + ntkMirror: a training-free open-weight implementation we're releasing today

Reddit r/LocalLLaMA ↗ · 2026-06-09

A paper accepted at ICML 2026 introduces predictable hallucination via an information-budget abstention gate, and releases ntkMirror, a training-free open-weight implementation that reduces hallucination by abstaining when information is insufficient, achieving 0.0–0.7% hallucination at ~24% abstention.

0 favorites 0 likes

#open-weights

@cohere: We encourage developers to share their builds with us and give feedback to shape future iterations. Let’s shape the fut…

X AI KOLs Following ↗ · 2026-06-09 Cached

Cohere and Cohere Labs released North Mini Code, an open weights 30B-A3B parameter model optimized for code generation, agentic software engineering, and terminal tasks, with strong benchmark results on SWE-Bench and Terminal-Bench.

0 favorites 0 likes

#open-weights

I fine-tuned Parakeet 0.6B for medical ASR — open weights, local Mac/CUDA/CPU

Reddit r/LocalLLaMA ↗ · 2026-06-09

Omi Health founder fine-tuned NVIDIA's Parakeet TDT 0.6B for medical ASR, releasing open-weights model Omi Med STT v1 that achieves competitive medical-WER while running locally on Mac, CUDA, or CPU.

0 favorites 0 likes

#open-weights

Was BitNet a dead end? What happened to ternary LLMs?

Reddit r/LocalLLaMA ↗ · 2026-06-08

The article questions why ternary language models like BitNet have not scaled beyond 2B parameters, given their initial promise, and discusses the apparent lack of progress from open-weight AI labs.

0 favorites 0 likes

#open-weights

@victormustar: Before the week ends, let's acknowledge one of the most INSANE week ever for open AI, with 25+ notable open-weight drop…

X AI KOLs Following ↗ · 2026-06-05 Cached

A recap of an extraordinary week in open AI, featuring over 25 open-weight model releases across LLMs, image generation, audio/speech, vision, and video/3D, with notable contributions from NVIDIA, Google, and others.

0 favorites 0 likes

#open-weights

CohereLabs/North-Mini-Code-1.0

Hugging Face Models Trending ↗ · 2026-06-05 Cached

Cohere Labs released North Mini Code, a 30B-parameter (3B active) open-weights model optimized for code generation, agentic software engineering, and terminal tasks, licensed under Apache 2.0.

0 favorites 0 likes

#open-weights

google/gemma-4-12B-it-qat-q4_0-gguf

Hugging Face Models Trending ↗ · 2026-06-05 Cached

Google DeepMind releases Gemma 4 models optimized with Quantization-Aware Training (QAT) in multiple formats including GGUF, enabling high quality with reduced memory requirements.

0 favorites 0 likes

#open-weights

@MaximeRivest: nemotron 550b ultra from nvidia just dropped! it's tool calling and standard system prompt is very very very clean and …

X AI KOLs Following ↗ · 2026-06-04

NVIDIA released Nemotron 550B Ultra, a large language model featuring a clean XML-based tool calling interface instead of JSON schemas, with tool results delivered as user messages in XML tags.

0 favorites 0 likes

open-weights

Submit Feedback