Tag
GLM-5.2 is offered for free via Hugging Face Inference Providers for the next 6 hours, encouraging use with coding agents to showcase open source advancement.
This article analyzes the reasons behind the performance leap of Zhipu GLM-5.2, suggesting that its 40B activation parameters provide greater effective capacity after accounting for fixed overhead, making RL post-training more effective. It also reviews the history of Chinese AI model development and notes that the large model approach ultimately prevailed.
We're the first to run the full GLM-5.2 (753B FP8) on RTX 4090s by porting sparse-attention kernels to Ada GPUs, enabling frontier open-weights model on commodity hardware.
UnslothAI announces GLM-5.2, Z.ai's strongest open model with 744B parameters, now runnable locally via dynamic GGUF quantization reducing size by ~84% to 239GB while retaining ~82% accuracy. It fits on 256GB Macs and supports long-context, reasoning, and agentic tasks.
A detailed user review of GLM-5.2 accessed via API, praising its long-context coherence, adaptive reasoning, and frontier-level text performance comparable to GPT-5.5, while noting the lack of native vision and high local compute requirements.
Chinese AI lab Z.ai released GLM-5.2, a 753B parameter open weights LLM with a 1M token context window under MIT license, achieving top scores on the Artificial Analysis Intelligence Index and ranking second on the Code Arena WebDev leaderboard.
Unsloth quantizations for the GLM 5.2 model are being released.
A user shares their Docker deployment configuration for running the GLM-5.2-FP8 model on HGX-H200 hardware using SGLang, achieving 262k context and 70 tokens/s.
Z.ai releases GLM-5.2, an open-weights AI model with improved coding and agentic performance, demonstrated by beating Kimi K2.7 Code on a physics simulation benchmark across three tasks.
GLM 5.2 has been released with open weights under MIT license on HuggingFace, available via API and Ollama, featuring competitive benchmarks that trail Opus 4.8 by a point and edge GPT-5.5 by one.
GLM-5.2, an open source AI model from zai-org, is now available on HuggingChat.
Sentdex reports that GLM 5.2 from Zai is the first open model that can replace GPT-5.5 and Opus 4.8 across many tasks, with strong coding and agentic performance and a 1M context window.
GLM 5.2 is released as a 753B parameter open-source model with 1M context length, MIT license, and achieves 99.2 on AIME 2026, outperforming GPT-5.5, Gemini 3.1 Pro, and Claude Opus 4.8.
Z.AI releases GLM-5.2, a new flagship model with a solid 1M-token context, enhanced coding capabilities with flexible thinking effort, and improved architecture via IndexShare. It is released under an MIT open-source license.
Z.AI releases GLM-5.2, a flagship open-source model with a solid 1M-token context, improved coding capabilities, and a new IndexShare sparse attention architecture that reduces FLOPs by 2.9x at 1M context.
User praises GLM 5.2 for being reliable and smart, but points out that lack of compute power leads to instability.
Zhipu released the GLM 5.2 model, focusing on coding capabilities, open-source and supporting 1M context. Tests show it approaches Claude Opus 4.8 level in large engineering and coding tasks, but lacks multimodal capabilities and is limited by computational power, resulting in slower speed. The article also mentions Anthropic shutting down Fable 5 and Mythos 5 at the request of the U.S. Department of Commerce, highlighting the contrast between open-source and closed AI.
Zhipu AI announces GLM-5.2, their most capable open-source model with a 1M context window, positioning it as a foundation for complex agent applications and coding models, with immediate availability to GLM Coding Plan users and API next week.