Unsloth Minimax M3 GGUF
Summary
Unsloth is uploading a GGUF quantized version of the MiniMax M3 model to Hugging Face.
Similar Articles
unsloth/Qwen3.6-27B-GGUF
Unsloth releases a GGUF quantized version of the Qwen3.6-27B model, featuring improved agentic coding capabilities, tool calling, and support for Unsloth Studio.
unsloth/gemma-4-26B-A4B-it-GGUF
Unsloth releases GGUF-quantized versions of Google DeepMind's Gemma 4 26B A4B instruction-tuned model, enabling efficient local inference with support for tool-calling and fine-tuning via Unsloth Studio. Gemma 4 is a multimodal MoE model with a 256K context window, supporting text, image, video, and audio inputs.
unsloth/Kimi-K2.6-GGUF
Unsloth releases quantized GGUF versions of the open-source 1T-parameter Kimi K2.6 MoE model, optimized for long-horizon coding, autonomous agent swarms, and production-ready design tasks.
unsloth/Qwen3.6-27B-MTP-GGUF
Unsloth has released GGUF weights for the Qwen3.6-27B model, featuring Multi-Token Prediction (MTP) for faster generation and enhanced agentic coding capabilities.
unsloth/North-Mini-Code-1.0-GGUF · Hugging Face
This page hosts GGUF quantized versions of Cohere's North-Mini-Code-1.0 model, a 30B-A3B MoE model optimized for code generation and agentic tasks. Instructions are provided for building llama.cpp from a specific PR to support the cohere2moe architecture.