unsloth/MiniMax-M3-GGUF

Hugging Face Models Trending 2026/06/12 05:56 模型

gguf multimodal image-text-to-text quantization unsloth minimax-m3 open-source

摘要

Unsloth 发布了 MiniMax-M3 多模态模型的 GGUF 量化版本，支持图像-文本到文本任务，兼容 Transformers、llama.cpp、vLLM 等推理引擎。

任务：图像-文本到文本标签：transformers, gguf, 多模态, MoE, 智能体, 编码, 视频, minimax_m3_vl, 图像-文本到文本, base_model:MiniMaxAI/MiniMax-M3, base_model:quantized:MiniMaxAI/MiniMax-M3, 许可证：其他, 端点兼容, 区域：美国, 对话式

查看原文

查看缓存全文

缓存时间: 2026/06/16 03:00

unsloth/MiniMax-M3-GGUF · Hugging Face 来源：https://huggingface.co/unsloth/MiniMax-M3-GGUF 库Transformers (https://huggingface.co/unsloth/MiniMax-M3-GGUF?library=transformers)如何使用 unsloth/MiniMax-M3-GGUF 搭配 Transformers： `# 使用 pipeline 作为高级助手 from transformers import pipeline pipe = pipeline("image-text-to-text", model="unsloth/MiniMax-M3-GGUF") messages = [ { "role": "user", "content": [ {"type": "image", "url": "https://huggingface.co/datasets/huggingface/documentation-images/resolve/main/p-blog/candy.JPG"}, {"type": "text", "text": "糖上是什么动物？"} ] }, ] pipe(text=messages)` `# 直接加载模型 from transformers import AutoModel model = AutoModel.from_pretrained("unsloth/MiniMax-M3-GGUF", dtype="auto")` llama-cpp-python (https://huggingface.co/unsloth/MiniMax-M3-GGUF?library=llama-cpp-python)如何使用 unsloth/MiniMax-M3-GGUF 搭配 llama-cpp-python： `# !pip install llama-cpp-python from llama_cpp import Llama llm = Llama.from_pretrained( repo_id="unsloth/MiniMax-M3-GGUF", filename="BF16/MiniMax-M3-BF16-00001-of-00018.gguf", )` `llm.create_chat_completion( messages = [ { "role": "user", "content": [ { "type": "text", "text": "用一句话描述这张图片。" }, { "type": "image_url", "image_url": { "url": "https://cdn.britannica.com/61/93061-050-99147DCE/Statue-of-Liberty-Island-New-York-Bay.jpg" } } ] } ] )` 笔记本Google Colab (https://huggingface.co/unsloth/MiniMax-M3-GGUF/colab)Kaggle (https://huggingface.co/unsloth/MiniMax-M3-GGUF/kaggle)本地应用设置 (https://huggingface.co/settings/local-apps)llama.cpp (https://huggingface.co/unsloth/MiniMax-M3-GGUF?local-app=llama.cpp)如何使用 unsloth/MiniMax-M3-GGUF 搭配 llama.cpp： ##### 通过 brew 安装 `brew install llama.cpp # 启动本地 OpenAI 兼容服务器（含 Web UI）： llama-server -hf unsloth/MiniMax-M3-GGUF:UD-Q4_K_M # 在终端直接运行推理： llama-cli -hf unsloth/MiniMax-M3-GGUF:UD-Q4_K_M` ##### 通过 WinGet 安装（Windows） `winget install llama.cpp # 启动本地 OpenAI 兼容服务器（含 Web UI）： llama-server -hf unsloth/MiniMax-M3-GGUF:UD-Q4_K_M # 在终端直接运行推理： llama-cli -hf unsloth/MiniMax-M3-GGUF:UD-Q4_K_M` ##### 使用预编译二进制文件 `# 从以下地址下载预编译二进制文件： # https://github.com/ggerganov/llama.cpp/releases # 启动本地 OpenAI 兼容服务器（含 Web UI）： ./llama-server -hf unsloth/MiniMax-M3-GGUF:UD-Q4_K_M # 在终端直接运行推理： ./llama-cli -hf unsloth/MiniMax-M3-GGUF:UD-Q4_K_M` ##### 从源码构建 `git clone https://github.com/ggerganov/llama.cpp.git cd llama.cpp cmake -B build cmake --build build -j --target llama-server llama-cli # 启动本地 OpenAI 兼容服务器（含 Web UI）： ./build/bin/llama-server -hf unsloth/MiniMax-M3-GGUF:UD-Q4_K_M # 在终端直接运行推理： ./build/bin/llama-cli -hf unsloth/MiniMax-M3-GGUF:UD-Q4_K_M` ##### 使用 Docker `docker model run hf.co/unsloth/MiniMax-M3-GGUF:UD-Q4_K_M` LM Studio JanvLLM (https://huggingface.co/unsloth/MiniMax-M3-GGUF?local-app=vllm)如何使用 unsloth/MiniMax-M3-GGUF 搭配 vLLM： ##### 通过 pip 安装并服务模型 # 通过 pip 安装 vLLM： pip install vllm # 启动 vLLM 服务器： vllm serve "unsloth/MiniMax-M3-GGUF" # 使用 curl 调用服务器（OpenAI 兼容 API）： curl -X POST "http://localhost:8000/v1/chat/completions" \ -H "Content-Type: application/json" \ --data '{ "model": "unsloth/MiniMax-M3-GGUF", "messages": [ { "role": "user", "content": [ { "type": "text", "text": "用一句话描述这张图片。" }, { "type": "image_url", "image_url": { "url": "https://cdn.britannica.com/61/93061-050-99147DCE/Statue-of-Liberty-Island-New-York-Bay.jpg" } } ] } ] }' ##### 使用 Docker `docker model run hf.co/unsloth/MiniMax-M3-GGUF:UD-Q4_K_M` SGLang (https://huggingface.co/unsloth/MiniMax-M3-GGUF?local-app=sglang)如何使用 unsloth/MiniMax-M3-GGUF 搭配 SGLang： ##### 通过 pip 安装并服务模型 # 通过 pip 安装 SGLang： pip install sglang # 启动 SGLang 服务器： python3 -m sglang.launch_server \ --model-path "unsloth/MiniMax-M3-GGUF" \ --host 0.0.0.0 \ --port 30000 # 使用 curl 调用服务器（OpenAI 兼容 API）： curl -X POST "http://localhost:30000/v1/chat/completions" \ -H "Content-Type: application/json" \ --data '{ "model": "unsloth/MiniMax-M3-GGUF", "messages": [ { "role": "user", "content": [ { "type": "text", "text": "用一句话描述这张图片。" }, { "type": "image_url", "image_url": { "url": "https://cdn.britannica.com/61/93061-050-99147DCE/Statue-of-Liberty-Island-New-York-Bay.jpg" } } ] } ] }' ##### 使用 Docker 镜像 docker run --gpus all \ --shm-size 32g \ -p 30000:30000 \ -v ~/.cache/huggingface:/root/.cache/huggingface \ --env "HF_TOKEN=" \ --ipc=host \ lmsysorg/sglang:latest \ python3 -m sglang.launch_server \ --model-path "unsloth/MiniMax-M3-GGUF" \ --host 0.0.0.0 \ --port 30000 # 使用 curl 调用服务器（OpenAI 兼容 API）： curl -X POST "http://localhost:30000/v1/chat/completions" \ -H "Content-Type: application/json" \ --data '{ "model": "unsloth/MiniMax-M3-GGUF", "messages": [ { "role": "user", "content": [ { "type": "text", "text": "用一句话描述这张图片。" }, { "type": "image_url", "image_url": { "url": "https://cdn.britannica.com/61/93061-050-99147DCE/Statue-of-Liberty-Island-New-York-Bay.jpg" } } ] } ] }' Ollama (https://huggingface.co/unsloth/MiniMax-M3-GGUF?local-app=ollama)如何使用 unsloth/MiniMax-M3-GGUF 搭配 Ollama： `ollama run hf.co/unsloth/MiniMax-M3-GGUF:UD-Q4_K_M` Unsloth Studio (https://huggingface.co/unsloth/MiniMax-M3-GGUF?local-app=unsloth)如何使用 unsloth/MiniMax-M3-GGUF 搭配 Unsloth Studio： ##### 安装 Unsloth Studio（macOS、Linux、WSL） `curl -fsSL https://unsloth.ai/install.sh | sh # 运行 unsloth studio unsloth studio -H 0.0.0.0 -p 8888 # 然后在浏览器中打开 http://localhost:8888 # 搜索 unsloth/MiniMax-M3-GGUF 开始聊天` ##### 安装 Unsloth Studio（Windows） `irm https://unsloth.ai/install.ps1 | iex # 运行 unsloth studio unsloth studio -H 0.0.0.0 -p 8888 # 然后在浏览器中打开 http://localhost:8888 # 搜索 unsloth/MiniMax-M3-GGUF 开始聊天` ##### 使用 HuggingFace Spaces 运行 Unsloth `# 无需设置 # 在浏览器中打开 https://huggingface.co/spaces/unsloth/studio # 搜索 unsloth/MiniMax-M3-GGUF 开始聊天` Pi (https://huggingface.co/unsloth/MiniMax-M3-GGUF?local-app=pi)如何使用 unsloth/MiniMax-M3-GGUF 搭配 Pi： ##### 启动 llama.cpp 服务器 `# 安装 llama.cpp： brew install llama.cpp # 启动本地 OpenAI 兼容服务器： llama-server -hf unsloth/MiniMax-M3-GGUF:UD-Q4_K_M` ##### 在 Pi 中配置模型 `# 安装 Pi： npm install -g @mariozechner/pi-coding-agent # 添加到 ~/.pi/agent/models.json： { "providers": { "llama-cpp": { "baseUrl": "http://localhost:8080/v1", "api": "openai-completions", "apiKey": "none", "models": [ { "id": "unsloth/MiniMax-M3-GGUF:UD-Q4_K_M" } ] } } }` ##### 运行 Pi `# 在项目目录中启动 Pi： pi` Hermes Agent新 (https://huggingface.co/unsloth/MiniMax-M3-GGUF?local-app=hermes-agent)如何使用 unsloth/MiniMax-M3-GGUF 搭配 Hermes Agent： ##### 启动 llama.cpp 服务器 `# 安装 llama.cpp： brew install llama.cpp # 启动本地 OpenAI 兼容服务器： llama-server -hf unsloth/MiniMax-M3-GGUF:UD-Q4_K_M` ##### 配置 Hermes `# 安装 Hermes： curl -fsSL https://hermes-agent.nousresearch.com/install.sh | bash hermes setup # 将 Hermes 指向本地服务器： hermes config set model.provider custom hermes config set model.base_url http://127.0.0.1:8080/v1 hermes config set model.default unsloth/MiniMax-M3-GGUF:UD-Q4_K_M` ##### 运行 Hermes `hermes` Atomic Chat新Docker Model Runner (https://huggingface.co/unsloth/MiniMax-M3-GGUF?local-app=docker-model-runner)如何使用 unsloth/MiniMax-M3-GGUF 搭配 Docker Model Runner： `docker model run hf.co/unsloth/MiniMax-M3-GGUF:UD-Q4_K_M` Lemonade (https://huggingface.co/unsloth/MiniMax-M3-GGUF?local-app=lemonade)如何使用 unsloth/MiniMax-M3-GGUF 搭配 Lemonade： ##### 拉取模型 `# 从 https://lemonade-server.ai/ 下载 Lemonade lemonade pull unsloth/MiniMax-M3-GGUF:UD-Q4_K_M` ##### 运行并聊天 `lemonade run user.MiniMax-M3-GGUF-UD-Q4_K_M` ##### 列出所有可用模型 `lemonade list`

相似文章

Unsloth MiniMax M3 GGUF

Reddit r/LocalLLaMA

Unsloth 正在将 MiniMax M3 模型的 GGUF 量化版本上传到 Hugging Face。

unsloth/gemma-4-26B-A4B-it-GGUF

Hugging Face Models Trending

# unsloth/gemma-4-26B-A4B-it-GGUF · Hugging Face 来源：[https://huggingface.co/unsloth/gemma-4-26B-A4B-it-GGUF](https://huggingface.co/unsloth/gemma-4-26B-A4B-it-GGUF) ## [https://huggingface.co/unsloth/gemma-4-26B-A4B-it-GGUF#read-our-how-to-run-gemma-4-guide](https://huggingface.co/unsloth/gemma-4-26B-A4B-it-GGUF#read-our-how-to-run-gemma-4-guide)阅读我们的[如何运行 Gemma 4 指南](https://docs.unsloth.ai/models/gemma-4)！ *请参阅[Unsloth Dynamic 2.0 GGUFs](https://unsloth.ai/docs/basics/unslot

unsloth/MiniMax-M3-GGUF

相似文章

Unsloth MiniMax M3 GGUF

unsloth/gemma-4-26B-A4B-it-GGUF

unsloth/ERNIE-Image-Turbo-GGUF

unsloth/MiMo-V2.5-GGUF · Hugging Face

unsloth/Kimi-K2.6-GGUF

提交意见反馈