Tag
Krea 2 is a 12-billion parameter text-to-image diffusion model released open-weight on Hugging Face, with Raw (base) and Turbo (post-trained) checkpoints available.
A comprehensive 2026 guide to 30 powerful LLMs that are free to use, distinguishing between hosted platforms and self-hostable open-weight models, with detailed hardware requirements and license considerations.
GLM-5.2, an open-weight model with Opus-level design capabilities, incorporates an anti-hacking module trained via RL to mitigate reward hacking and improve performance on long-running tasks.
GLM-5.2 is an open-source long-context model with a solid 1M-token context, strong coding capabilities, and an MIT license, now available on Hugging Face.
Empero AI releases Qwythos-9B, a fine-tuned reasoning model with 1M token context and uncensored capabilities, showing large benchmark improvements over its Qwen3.5-9B base.
The author argues that governments and companies will increasingly switch to open-weight AI models to avoid US government control over access, noting that open models are only slightly behind closed ones and move at a pace more suited to real-world bureaucracy.
GLM-5.2 is an open weight AI model optimized for creative writing tasks, claimed to be the best in its category.
A community member proposes creating a crowdsourced coding dataset for local LLMs to enable collaborative model training and fine-tuning, addressing concerns about future availability of open-weight models.
Mistral announces a new family of open-weight models scheduled for release in July.
The article argues that relying on proprietary frontier AI APIs is risky due to unpredictable cost increases, availability changes, and lack of auditability, advocating for open-weight models as a more trustworthy alternative.
A new initiative called Trace Commons aims to collect coding agent traces into an open CC-BY-4.0 dataset to help train open-weight and open-source models, countering the data advantage of proprietary models from Anthropic and OpenAI.
A test of the open-weight MiniMax M3 model using MLX-VLM on a Mac Studio shows it can autonomously fill out a US customs form from a driver's license photo and a scanned document, using tool calls for fields, checkboxes, and signature.
Cohere released a new lightweight 30B open-weight model for agentic coding tasks, built on Command A+ with parallel transformer design, showing strong performance on agentic benchmarks like Terminal-Bench and SWE-Bench.
A benchmark comparison of local open-weight LLMs on a single H100 (FP8) shows DiffusionGemma is 4x faster but makes 6x more mistakes than Gemma4 26B A4B, highlighting trade-offs between speed and accuracy in diffusion versus autoregressive models.
Fable-5/Mythos achieves new SOTA on agentic search but is expensive for self-hosting, while open-weight Harness-1 offers a cost-effective alternative with fewer query restrictions.
The article argues that the harness (the system around the model) is as important as the model itself for agent performance, citing evidence from various benchmarks and experiments.
This paper introduces AI-MASLD, a stress-audit framework for medical LLMs that reveals how benchmark accuracy can hide serious safety failures, and demonstrates that open-weight models can match or exceed proprietary ones on safety dimensions.
SafeGene proposes a reusable safety-adapter module that decouples safety capabilities from task-specific updates, enabling efficient restoration of safety alignment in open-weight LLMs after downstream fine-tuning through few-shot recalibration.
Liquid AI released LFM2.5-VL-1.6B-Extract and LFM2.5-VL-450M-Extract, vision-language models that output structured JSON from images and field lists. The models are open-weight and available in two sizes.
Nemotron 3 Ultra is an open-weight release with an impressive capability-to-efficiency ratio, using a Mamba-2-attention hybrid stack and LatentMoE, and is larger than the previous Super variant.