GLM-5.2 can now run locally in llama.cpp and Unsloth Studio.
Summary
GLM-5.2 is now supported for local execution via llama.cpp and Unsloth Studio.
Similar Articles
PSA: unsloth/GLM-5.2-GGUF is uploading
unsloth has uploaded a GGUF version of GLM-5.2 to Hugging Face, providing ready-to-use model files for various inference engines like llama.cpp, vLLM, and SGLang.
GLM-5.2 is a win for local AI
GLM-5.2, a 753B parameter open-source model with MIT license, offers frontier-level coding capabilities and massive context window. Its distillation potential promises significant improvements for local AI setups.
Giving GLM-5.2 a spin locally on CPU only! (poor man's rig for big models)
A user runs GLM-5.2 locally on CPU only, demonstrating how to run a large model on a modest setup.
@pcuenq: GLM 5.2 has just been released Here it's already running with MLX on two Mac Studios (M3 Ultra). This is comparable to …
GLM 5.2, an open-weight AI model comparable to top closed models, has been released and is now running on MLX on two Mac Studios (M3 Ultra).
ggml-org/llama.cpp
llama.cpp is an open-source C/C++ library for efficient LLM inference on local hardware, supporting various quantization methods and multiple backends (CPU, GPU, etc.).