open-weights

Tag

Cards List
#open-weights

GLM-5.2 matched Claude Opus on 45 terminal-bench coding-agent tasks at less than half the cost (full methodology + failure transcripts inside)

Reddit r/ArtificialInteligence · 9h ago

GLM-5.2 matches Claude Opus on 45 coding-agent tasks at lower cost, with 43 of 45 tasks having identical outcomes.

0 favorites 0 likes
#open-weights

Human Evaluation of GLM-5.2

Reddit r/LocalLLaMA · yesterday

The author praises GLM-5.2, an MIT open-weights model, for its exceptional real-world performance in human evaluation benchmarks, claiming it rivals the best closed-source models like those from Claude.

0 favorites 0 likes
#open-weights

GLM 5.2 vs. Opus

Hacker News Top · 2d ago Cached

GLM 5.2 is a new open-weights model from Z.ai, compared against Claude Opus in a 3D game coding task. Opus performed faster and cleaner, but GLM 5.2 offers compelling cost and accessibility advantages.

0 favorites 0 likes
#open-weights

@losterror501: with 2dgx sparks getting 25tok/sec with 1 session and it peaks to 152tok/sec with 8 sessions. Actually insane...

X AI KOLs Timeline · 3d ago Cached

Announcement of Qwable-v1, an open-weights model distilled from Claude Fable-5, along with performance benchmarks on 2dgx sparks hardware achieving 25 tok/sec (single session) and 152 tok/sec (8 sessions).

0 favorites 0 likes
#open-weights

I released a softmax-free attention model at GPT-2 Medium scale (~354M params, 11.5B tokens): structural sparsity + tile-skipping kernels for long-context VRAM savings. Open weights + custom Triton kernels [R]

Reddit r/MachineLearning · 3d ago Cached

Released RRT-355M, a softmax-free attention model at GPT-2 Medium scale with 354M parameters trained from scratch on 11.5B tokens, using structural sparsity and tile-skipping kernels for long-context efficiency, achieving comparable performance to GPT-2 Medium on a 22-task benchmark.

0 favorites 0 likes
#open-weights

@browser_use: Open-weights models have officially caught up We tried GLM 5.2 in BrowserCode > Near Opus-level score > Cheapest model …

X AI KOLs Following · 5d ago Cached

Open-weights models have caught up with proprietary ones, with GLM 5.2 achieving near Opus-level scores in browser agent tasks at low cost. Other models like Minimax M3 and Kimi k2.7 also show notable improvements.

0 favorites 0 likes
#open-weights

Toward Open Weight Models Without Risks: Separating Public and Private Capabilities in LLMs

Hugging Face Daily Papers · 6d ago Cached

This paper introduces Tiered Language Models (TLMs), which allow a single set of open-weight model parameters to support multiple capability levels controlled by secret keys. The method enables selective exposure of private capabilities while preserving public model behavior and resisting extraction.

0 favorites 0 likes
#open-weights

Mistral AI to get Code and Apps features on Vibe (2 minute read)

TLDR AI · 6d ago Cached

Mistral AI is adding dedicated Code and Apps sections to its Vibe (Le Chat) web platform, turning it from a conversational interface into a development and app-building environment. A new large, sparse mixture-of-experts model is also confirmed for summer release as open weights.

0 favorites 0 likes
#open-weights

Updates on North Mini Code: 4 bit quant + Ollama + OpenRouter

Reddit r/LocalLLaMA · 6d ago Cached

Cohere releases North Mini Code, a 30B-A3B open-weights model with 4-bit quantization for code generation and agentic coding tasks, supporting 256K context.

0 favorites 0 likes
#open-weights

@MaximeRivest: glm 5.2 is good (enough) and this is important. glm 5.2 is good enough to change information technology in very fundame…

X AI KOLs Following · 6d ago Cached

GLM 5.2 is an open-weights LLM that is sufficiently capable to allow businesses to manage their IT needs locally on affordable hardware, potentially transforming small/medium enterprise data management.

0 favorites 0 likes
#open-weights

@totheagi: We're the first to make the full GLM-5.2 (FP8) run on RTX 4090s. GLM-5.2 is the new 753B SOTA open-weights model, and i…

X AI KOLs Timeline · 6d ago Cached

We're the first to run the full GLM-5.2 (753B FP8) on RTX 4090s by porting sparse-attention kernels to Ada GPUs, enabling frontier open-weights model on commodity hardware.

0 favorites 0 likes
#open-weights

@googledevs: Autonomous AI in action. Check out how the new Gemma 4 31B model operates as an ADK Agent, exploring, planning, and run…

X AI KOLs Following · 2026-06-18 Cached

Google DeepMind released the Gemma 4 series of open-weight models, covering four sizes from 2B to 31B, supporting 128K–256K context, reasoning, and function calling, under Apache 2.0 license, and equipped with ADK framework for autonomous agent capabilities.

0 favorites 0 likes
#open-weights

GLM-5.2 is probably the most powerful text-only open weights LLM

Simon Willison's Blog · 2026-06-17 Cached

Chinese AI lab Z.ai released GLM-5.2, a 753B parameter open weights LLM with a 1M token context window under MIT license, achieving top scores on the Artificial Analysis Intelligence Index and ranking second on the Code Arena WebDev leaderboard.

0 favorites 0 likes
#open-weights

@heyshrutimishra: Apodex 1.0 dropped and the architecture is genuinely different. It's post-trained on Qwen3.5 as a self-evolving system:…

X AI KOLs Following · 2026-06-17 Cached

Apodex 1.0 is a self-evolving AI system post-trained on Qwen3.5, achieving SOTA on BrowseComp, DeepSearchQA, and HLE-text. Its 4B mini model outperforms 30B-class models, with an AgentOS runtime for task orchestration. Open weights available.

0 favorites 0 likes
#open-weights

GLM-5.2 is the new leading open weights model on Artificial Analysis

Hacker News Top · 2026-06-17 Cached

Z ai's GLM-5.2 has become the new leading open weights model on the Artificial Analysis Intelligence Index, scoring 51 and outperforming competitors like MiniMax-M3 and DeepSeek V4 Pro. The model features 744B total parameters, 40B active, MIT license, and 1M context window.

0 favorites 0 likes
#open-weights

Kimi K2.7 Code: 1T MoE, $0.95/M tokens, MIT license, beats Opus 4.8 on MCP tool-calling

Reddit r/AI_Agents · 2026-06-17

Moonshot AI 发布了专注于编程的开放式权重模型 Kimi K2.7 Code,拥有1万亿参数和384个专家,性能在MCP工具调用上超越Opus 4.8,成本仅为十分之一。

0 favorites 0 likes
#open-weights

@xeophon: 51 is the same score as GPT-5.4 xhigh, btw. The model was released 3 months ago and frontier at the time

X AI KOLs Following · 2026-06-17 Cached

Z ai's GLM-5.2 open weights model scores 51 on the Artificial Analysis Intelligence Index, matching GPT-5.4 xhigh and sitting on the Pareto frontier of intelligence vs cost per task.

0 favorites 0 likes
#open-weights

@cline: Step 3.7 Flash is free in Cline for the next month. It beats Gemini and DeepSeek flash models, and comes surprisingly c…

X AI KOLs Following · 2026-06-17 Cached

Step 3.7 Flash, an open-weights model with a 256k context window, is available free in Cline for a month, claiming to outperform Gemini and DeepSeek flash models and approach frontier performance on SWE Bench.

0 favorites 0 likes
#open-weights

@atomic_chat_hq: New @Zai_org GLM-5.2 beats Kimi K2.7 Code on physics contest! We gave both models the same three prompts and asked them…

X AI KOLs Following · 2026-06-17 Cached

Z.ai releases GLM-5.2, an open-weights AI model with improved coding and agentic performance, demonstrated by beating Kimi K2.7 Code on a physics simulation benchmark across three tasks.

0 favorites 0 likes
#open-weights

GLM-5.2 just dropped open weights and it already looks weirdly strong for coding

Reddit r/LocalLLaMA · 2026-06-16

GLM-5.2 has been released with open weights under MIT license, featuring a 1M context window and two reasoning effort modes. Early benchmarks show it performing strongly in coding tasks, making it worth testing beyond benchmark screenshots.

0 favorites 0 likes
Next →
← Back to home

Submit Feedback