Human Evaluation of GLM-5.2
Summary
The author praises GLM-5.2, an MIT open-weights model, for its exceptional real-world performance in human evaluation benchmarks, claiming it rivals the best closed-source models like those from Claude.
Similar Articles
GLM-5.2 is the new leading open weights model on Artificial Analysis
Z ai's GLM-5.2 has become the new leading open weights model on the Artificial Analysis Intelligence Index, scoring 51 and outperforming competitors like MiniMax-M3 and DeepSeek V4 Pro. The model features 744B total parameters, 40B active, MIT license, and 1M context window.
GLM-5.2 is the first open-weights model to cross 80% on Terminal-Bench and beats every other open model available
GLM-5.2 is the first open-weights model to exceed 80% on Terminal-Bench, surpassing all other open models and even Gemini, making it a frontier-level model at a fraction of the cost.
GLM-5.2 just dropped open weights and it already looks weirdly strong for coding
GLM-5.2 has been released with open weights under MIT license, featuring a 1M context window and two reasoning effort modes. Early benchmarks show it performing strongly in coding tasks, making it worth testing beyond benchmark screenshots.
@haider1: GLM 5.2 feels like the opus 4.5 moment for open-weight models what genuinely impressed me was during long, multi-step a…
GLM 5.2 marks a significant milestone for open-weight models, demonstrating strong context retention across long multi-step tasks and more reliable tool calling.
GLM-5.2 Raises the Bar for Open Models (14 minute read)
GLM-5.2 is a new open-source AI model that sets a high bar for open models, though it still trails proprietary frontier models and lacks some features like vision.