GLM-5.2 is the new leading open weights model on Artificial Analysis

Hacker News Top Models

Summary

Z ai's GLM-5.2 has become the new leading open weights model on the Artificial Analysis Intelligence Index, scoring 51 and outperforming competitors like MiniMax-M3 and DeepSeek V4 Pro. The model features 744B total parameters, 40B active, MIT license, and 1M context window.

No content available
Original Article
View Cached Full Text

Cached at: 06/17/26, 11:40 AM

# GLM-5.2 is the new leading open weights model on the Artificial Analysis Intelligence Index Source: [https://artificialanalysis.ai/articles/glm-5-2-is-the-new-leading-open-weights-model-on-the-artificial-analysis-intelligence-index](https://artificialanalysis.ai/articles/glm-5-2-is-the-new-leading-open-weights-model-on-the-artificial-analysis-intelligence-index) **Z ai’s GLM\-5\.2 is the new leading open weights model on the Artificial Analysis Intelligence Index scoring 51 and it sits on the Pareto frontier of Intelligence vs Cost per Task** GLM\-5\.2 is the same size as GLM\-5\.1 \(744B total / 40B active parameters\) but scores 11 points higher on the Intelligence Index v4\.1, placing ahead of MiniMax\-M3 \(44\) and DeepSeek V4 Pro \(max, 44\)\. On the first\-party API it is priced in line with GLM\-5\.1 at $1\.4/$4\.4/$0\.26 per 1M input/output/cache hit tokens **Key results:** ➤**GLM\-5\.2 is the leading open weights model on the Intelligence Index v4\.1\.**At 51, it leads MiniMax\-M3 \(44\), DeepSeek V4 Pro \(max, 44\) and Kimi K2\.6 \(43\) ➤**Improvements across most evaluations, particularly scientific reasoning:**GLM\-5\.2 gains over GLM\-5\.1 on most evaluations, led by scientific reasoning on CritPt \(\+16 points to 21%\) and HLE \(\+12 points to 40%\), alongside AA\-LCR \(\+9 points to 71%\), tau3 banking \(\+15 points to 27%\) and SciCode \(\+7 points to 50%\)\. TerminalBench v2\.1 also improves \(\+16 points to 78%\) and GPQA Diamond gains 3 points to 89% **➤ Leading open weights model on GDPval\-AA v2 and competitive with proprietary models:**GLM\-5\.2 scores 1524 on GDPval\-AA v2, ahead of MiniMax\-M3 \(1418\) and DeepSeek V4 Pro \(max, 1328\)\. This impressive result places GLM\-5\.2 in\-line with proprietary models including GPT\-5\.5 \(xhigh reasoning\)\. GDPval\-AA v2 builds on the original GDPval\-AA by baselining Elo to human performance at 1000, introducing a rotating panel of frontier\-model judges, and raising the turn limit from 100 to 250 for longer\-horizon agent trajectories ➤**GLM\-5\.2 uses more output tokens per task than other leading open weights models:**the model uses 43k output tokens per Intelligence Index task, up from GLM\-5\.1 \(26k\) and above MiniMax\-M3 \(24k\), Kimi K2\.6 \(35k\) and DeepSeek V4 Pro \(max, 37k\) ➤**On the Intelligence vs\. Cost per Task Pareto Frontier:**GLM\-5\.2 is on the Pareto frontier of the Intelligence vs Cost per Task chart, with the lowest cost per task among models at its intelligence level\. GLM\-5\.2 costs ~$0\.46 per task, compared to GLM\-5\.1 \($0\.25\), Kimi K2\.6 \($0\.31\), MiniMax\-M3 \($0\.18\) and DeepSeek V4 Pro \(max, $0\.05\) **Additional Model Details:** ➤**License:**MIT ➤**Size:**744B total parameters, 40B active parameters, equivalent to GLM\-5\.1 ➤**Context window:**1M tokens, up from 200K on GLM\-5\.1 ➤**Pricing:**$1\.4/$0\.26/$4\.4 per 1M input/cache hit/output tokens ➤**Availability:**Alongside Z ai's first\-party API, GLM\-5\.2 is available across third\-party providers including DeepInfra, Novita, Nebius, Parasail, Siliconflow, GMI Cloud, Baseten, and Fireworks ![](https://cdn.sanity.io/images/6vfeftx9/articles/02808927b38ad45932bb0409bc1e723380fe3ce1-4640x4304.png?w=1200&auto=format) GLM\-5\.2 leads all open weights models on GDPval\-AA v2, our primary metric for real\-world agentic performance\. At 1524 it places ahead of MiniMax\-M3 \(1418\) and DeepSeek V4 Pro \(max, 1328\), and is effectively level with GPT\-5\.5 \(xhigh, 1514\)\. We visually inspected GLM\-5\.2's outputs across a range of GDPval\-AA tasks\. We have attached a selection below\. ![](https://cdn.sanity.io/images/6vfeftx9/articles/56965667c984b44ae4188c53f6eac28d2f52ed9b-4512x1968.png?w=1200&auto=format)![](https://cdn.sanity.io/images/6vfeftx9/articles/4ed3d217ccb7536bcba20755b335e3ad76852711-2500x3596.png?w=1200&auto=format)![](https://cdn.sanity.io/images/6vfeftx9/articles/58a359dae42b53499f2c684ede9be416c6da5260-2462x1384.png?w=1200&auto=format)![](https://cdn.sanity.io/images/6vfeftx9/articles/b48eb27f3ab92006247d42b236a833d2704adc66-2210x1384.png?w=1200&auto=format) GLM\-5\.2 scores 4 on the AA\-Omniscience Index, up from GLM\-5\.1 \(2\)\. The gain comes from both higher accuracy \(25\.1% vs 24\.2%\) and a lower hallucination rate \(28\.1% vs 29\.4%\), with attempt rate flat at 47%\. ![](https://cdn.sanity.io/images/6vfeftx9/articles/4635327905c84aee5a7f3b60b377badab1c2970d-4512x5984.png?w=1200&auto=format) GLM\-5\.2 uses 43k output tokens per Intelligence Index task, of which 37k is reasoning\. This is up from GLM\-5\.1 \(26k\) and higher than open weights peers MiniMax\-M3 \(24k\) and Kimi K2\.6 \(35k\), placing it among the less token\-efficient open weights models at its intelligence level\. GLM\-5\.2 sits off the most attractive quadrant on the Intelligence vs Output Tokens chart\. ![](https://cdn.sanity.io/images/6vfeftx9/articles/300314a95998b87fafd9d0643485d6d15df88920-4640x4288.png?w=1200&auto=format) Breakdown of the individual evaluations in the Artificial Analysis Intelligence Index v4\.1\. ![](https://cdn.sanity.io/images/6vfeftx9/articles/1cba772f84ae07633a1d347218cd619822f8d99a-4512x6776.png?w=1200&auto=format) Compare GLM\-5\.2 with other leading models at:[https://artificialanalysis\.ai/models/glm\-5\-2](https://artificialanalysis.ai/models/glm-5-2)

Similar Articles

GLM-5.2 is probably the most powerful text-only open weights LLM

Simon Willison's Blog

Chinese AI lab Z.ai released GLM-5.2, a 753B parameter open weights LLM with a 1M token context window under MIT license, achieving top scores on the Artificial Analysis Intelligence Index and ranking second on the Code Arena WebDev leaderboard.

GLM-5.2 is a win for local AI

Reddit r/LocalLLaMA

GLM-5.2, a 753B parameter open-source model with MIT license, offers frontier-level coding capabilities and massive context window. Its distillation potential promises significant improvements for local AI setups.