Unsloth Minimax M3 GGUF

Reddit r/LocalLLaMA Models

Summary

Unsloth is uploading a GGUF quantized version of the MiniMax M3 model to Hugging Face.

Still being uploaded for now: [https://huggingface.co/unsloth/MiniMax-M3-GGUF](https://huggingface.co/unsloth/MiniMax-M3-GGUF)
Original Article

Similar Articles

unsloth/Qwen3.6-27B-GGUF

Hugging Face Models Trending

Unsloth releases a GGUF quantized version of the Qwen3.6-27B model, featuring improved agentic coding capabilities, tool calling, and support for Unsloth Studio.

unsloth/gemma-4-26B-A4B-it-GGUF

Hugging Face Models Trending

Unsloth releases GGUF-quantized versions of Google DeepMind's Gemma 4 26B A4B instruction-tuned model, enabling efficient local inference with support for tool-calling and fine-tuning via Unsloth Studio. Gemma 4 is a multimodal MoE model with a 256K context window, supporting text, image, video, and audio inputs.

unsloth/Kimi-K2.6-GGUF

Hugging Face Models Trending

Unsloth releases quantized GGUF versions of the open-source 1T-parameter Kimi K2.6 MoE model, optimized for long-horizon coding, autonomous agent swarms, and production-ready design tasks.

unsloth/Qwen3.6-27B-MTP-GGUF

Hugging Face Models Trending

Unsloth has released GGUF weights for the Qwen3.6-27B model, featuring Multi-Token Prediction (MTP) for faster generation and enhanced agentic coding capabilities.

unsloth/North-Mini-Code-1.0-GGUF · Hugging Face

Reddit r/LocalLLaMA

This page hosts GGUF quantized versions of Cohere's North-Mini-Code-1.0 model, a 30B-A3B MoE model optimized for code generation and agentic tasks. Instructions are provided for building llama.cpp from a specific PR to support the cohere2moe architecture.