@Thom_Wolf: To all the newcomers excited to try Opus 4.8-level models at home: welcome to OpenWeightLand! Things work a little diff…
Summary
GLM-5.2 is an open-source long-context model with a solid 1M-token context, strong coding capabilities, and an MIT license, now available on Hugging Face.
View Cached Full Text
Cached at: 06/20/26, 08:22 PM
To all the newcomers excited to try Opus 4.8-level models at home: welcome to OpenWeightLand!
Things work a little differently here than in ClosedSourcistan. Might seem strange at first but you’ll quickly get used to it:
- there are many providers for the same model and they compete on price and features.
- as a result intelligence is abundant and typically much cheaper
- you can run the model on-prem, in your region, locally, or with the provider of your choice
- you can fine-tune it, modify it, and build businesses on top of it without asking anyone for permission
Turns out open weights create markets, not kingdoms.
A good central train station to start exploring is the Hugging Face page for GLM-5.2 under “Use this model”: -> https://huggingface.co/zai-org/GLM-5.2
And if you just want to chat with it, it’s free on HuggingChat: -> https://huggingface.co/chat/
zai-org/GLM-5.2 · Hugging Face
Source: https://huggingface.co/zai-org/GLM-5.2
👋 Join ourWeChatorDiscordcommunity. 📖 Check out the GLM-5.2blogand GLM-5Technical report. 📍 Use GLM-5.2 API services onZ.ai API Platform. 🔜 Try GLM-5.2here.
https://huggingface.co/zai-org/GLM-5.2#introductionIntroduction
We’re introducing GLM-5.2, our latest flagship model for long-horizon tasks. It marks a substantial leap in long-horizon task capability over its predecessor GLM-5.1 and, for the first time, delivers that capability on asolid 1M-token context. GLM-5.2’s new capabilities include:
- **Solid 1M Context:**A solid 1M-token context that stably sustains long-horizon work
- Advanced Coding with Flexible Effort: Stronger coding capabilities with multiple thinking effort levels to balance performance and latency
- Improved Architecture: We proposeIndexShare, which reuses the same indexer across every four sparse attention layers, reducing per-token FLOPs by 2.9× at a 1M context length. We also improve GLM-5.2’s MTP layer for speculative decoding, increasing the acceptance length by up to 20%
- Pure Open: An MIT open-source license — no regional limits, technical access without borders
https://huggingface.co/zai-org/GLM-5.2#benchmarkBenchmark
BenchmarkGLM-5.2GLM-5.1Qwen3.7-MaxMiniMax M3DeepSeek-V4-ProClaude Opus 4.8GPT-5.5Gemini 3.1 ProReasoningHLE40.53141.43737.749.8*41.4*45HLE (w/ Tools)54.752.353.5-48.257.9*52.2*51.4*CritPt20.94.613.43.712.920.927.117.7AIME 202699.295.397-94.695.798.398.2HMMT Nov. 202594.4949584.494.496.596.594.8HMMT Feb. 202692.582.697.184.495.296.796.787.3IMOAnswerBench91.083.890-89.883.5-81GPQA-Diamond91.286.2909390.193.693.694.3CodingSWE-bench Pro62.158.460.65955.469.258.654.2NL2Repo48.942.747.242.135.569.750.733.4DeepSWE46.21818208587010ProgramBench63.750.9--47.871.970.839.5Terminal Bench 2.1 (Terminus-2)81.063.5756564858474Terminal Bench 2.1 (Best Reported Harness)82.769---78.983.470.7FrontierSWE (Dominance)74.430.5--29.075.172.639.6PostTrainBench34.320.1---37.228.421.6SWE-Marathon13.01.0---26.012.04.0AgenticMCP-Atlas (Public Set)76.871.876.474.273.677.875.369.2Tool-Decathlon48.240.7--52.859.955.648.8
https://huggingface.co/zai-org/GLM-5.2#serve-glm-52-locallyServe GLM-5.2 Locally
GLM-5.2 supports deployment with the following frameworks. Feel free to try them out:
- SGLang(v0.5.13.post1+) — seecookbook
- vLLM(v0.23.0+) — seerecipes
- Transformers(v0.5.12+) — seetransformers docs
- KTransformers(v0.5.12+) — seetutorial
- Unsloth(v0.1.47-beta+) — seeguide
- For deployment on the
Ascend NPUplatform, inference frameworks such as vLLM-Ascend, xLLM and SGLang are supported — seehere.
https://huggingface.co/zai-org/GLM-5.2#citationCitation
If you find GLM-5.2 useful in your research, please cite our technical report:
@misc{glm5team2026glm5vibecodingagentic,
title={GLM-5: from Vibe Coding to Agentic Engineering},
author={GLM-5-Team and : and Aohan Zeng and Xin Lv and Zhenyu Hou and Zhengxiao Du and Qinkai Zheng and Bin Chen and Da Yin and Chendi Ge and Chenghua Huang and Chengxing Xie and Chenzheng Zhu and Congfeng Yin and Cunxiang Wang and Gengzheng Pan and Hao Zeng and Haoke Zhang and Haoran Wang and Huilong Chen and Jiajie Zhang and Jian Jiao and Jiaqi Guo and Jingsen Wang and Jingzhao Du and Jinzhu Wu and Kedong Wang and Lei Li and Lin Fan and Lucen Zhong and Mingdao Liu and Mingming Zhao and Pengfan Du and Qian Dong and Rui Lu and Shuang-Li and Shulin Cao and Song Liu and Ting Jiang and Xiaodong Chen and Xiaohan Zhang and Xuancheng Huang and Xuezhen Dong and Yabo Xu and Yao Wei and Yifan An and Yilin Niu and Yitong Zhu and Yuanhao Wen and Yukuo Cen and Yushi Bai and Zhongpei Qiao and Zihan Wang and Zikang Wang and Zilin Zhu and Ziqiang Liu and Zixuan Li and Bojie Wang and Bosi Wen and Can Huang and Changpeng Cai and Chao Yu and Chen Li and Chengwei Hu and Chenhui Zhang and Dan Zhang and Daoyan Lin and Dayong Yang and Di Wang and Ding Ai and Erle Zhu and Fangzhou Yi and Feiyu Chen and Guohong Wen and Hailong Sun and Haisha Zhao and Haiyi Hu and Hanchen Zhang and Hanrui Liu and Hanyu Zhang and Hao Peng and Hao Tai and Haobo Zhang and He Liu and Hongwei Wang and Hongxi Yan and Hongyu Ge and Huan Liu and Huanpeng Chu and Jia'ni Zhao and Jiachen Wang and Jiajing Zhao and Jiamin Ren and Jiapeng Wang and Jiaxin Zhang and Jiayi Gui and Jiayue Zhao and Jijie Li and Jing An and Jing Li and Jingwei Yuan and Jinhua Du and Jinxin Liu and Junkai Zhi and Junwen Duan and Kaiyue Zhou and Kangjian Wei and Ke Wang and Keyun Luo and Laiqiang Zhang and Leigang Sha and Liang Xu and Lindong Wu and Lintao Ding and Lu Chen and Minghao Li and Nianyi Lin and Pan Ta and Qiang Zou and Rongjun Song and Ruiqi Yang and Shangqing Tu and Shangtong Yang and Shaoxiang Wu and Shengyan Zhang and Shijie Li and Shuang Li and Shuyi Fan and Wei Qin and Wei Tian and Weining Zhang and Wenbo Yu and Wenjie Liang and Xiang Kuang and Xiangmeng Cheng and Xiangyang Li and Xiaoquan Yan and Xiaowei Hu and Xiaoying Ling and Xing Fan and Xingye Xia and Xinyuan Zhang and Xinze Zhang and Xirui Pan and Xu Zou and Xunkai Zhang and Yadi Liu and Yandong Wu and Yanfu Li and Yidong Wang and Yifan Zhu and Yijun Tan and Yilin Zhou and Yiming Pan and Ying Zhang and Yinpei Su and Yipeng Geng and Yong Yan and Yonglin Tan and Yuean Bi and Yuhan Shen and Yuhao Yang and Yujiang Li and Yunan Liu and Yunqing Wang and Yuntao Li and Yurong Wu and Yutao Zhang and Yuxi Duan and Yuxuan Zhang and Zezhen Liu and Zhengtao Jiang and Zhenhe Yan and Zheyu Zhang and Zhixiang Wei and Zhuo Chen and Zhuoer Feng and Zijun Yao and Ziwei Chai and Ziyuan Wang and Zuzhou Zhang and Bin Xu and Minlie Huang and Hongning Wang and Juanzi Li and Yuxiao Dong and Jie Tang},
year={2026},
eprint={2602.15763},
archivePrefix={arXiv},
primaryClass={cs.LG},
url={https://arxiv.org/abs/2602.15763},
}
Similar Articles
@haider1: GLM 5.2 feels like the opus 4.5 moment for open-weight models what genuinely impressed me was during long, multi-step a…
GLM 5.2 marks a significant milestone for open-weight models, demonstrating strong context retention across long multi-step tasks and more reliable tool calling.
GLM 5.2 API is live, weights are on HF, and ollama has it already
GLM 5.2 has been released with open weights under MIT license on HuggingFace, available via API and Ollama, featuring competitive benchmarks that trail Opus 4.8 by a point and edge GPT-5.5 by one.
@mervenoyann: GLM-5.2 is comparable to Opus 4.8 with 1M context > new IS attention reuses one indexer every 4 sparse layers (2.9× per…
GLM-5.2 is a new model comparable to Opus 4.8, featuring 1M context, new IS attention, improved speculative decoding, and flexible thinking-effort levels. It is released under MIT license with day-0 support in transformers, vLLM, and SGLang.
GLM-5.2 just dropped open weights and it already looks weirdly strong for coding
GLM-5.2 has been released with open weights under MIT license, featuring a 1M context window and two reasoning effort modes. Early benchmarks show it performing strongly in coding tasks, making it worth testing beyond benchmark screenshots.
GLM 5.2 vs. Opus
GLM 5.2 is a new open-weights model from Z.ai, compared against Claude Opus in a 3D game coding task. Opus performed faster and cleaner, but GLM 5.2 offers compelling cost and accessibility advantages.
