tencent/Hy-MT2-7B

Hugging Face Models Trending 2026/05/11 14:56 模型

multilingual-translation open-source fast-thinking quantization benchmark tencent-hunyuan

摘要

腾讯开源了Hy-MT2系列快速思考的多语言翻译模型（1.8B、7B、30B-A3B），支持33种语言，同时提供了用于设备端部署的极端量化方法以及一个新的指令遵循基准IFMTBench。

任务：翻译标签：transformers, safetensors, hunyuan_v1_dense, text-generation, translation, zh, en, fr, pt, es, ja, tr, ru, ar, ko, th, it, de, vi, ms, id, tl, hi, pl, cs, nl, km, my, fa, gu, ur, te, mr, he, bn, ta, uk, bo, kk, mn, ug, arxiv:2605.22064, endpoints_compatible, region:us

查看原文

查看缓存全文

缓存时间: 2026/05/23 13:48

tencent/Hy-MT2-7B · Hugging Face

来源：https://huggingface.co/tencent/Hy-MT2-7B English |中文 (https://huggingface.co/tencent/README_CN.md)

HuggingFace (https://huggingface.co/collections/tencent/hy-mt2)ModelScope (https://modelscope.cn/collections/Tencent-Hunyuan/Hy-MT2)

🖥️官方网站 (https://aistudio.tencent.com/llm/en?tabIndex=0)| 💬GitHub (https://github.com/Tencent-Hunyuan/Hy-MT2)| 🪡AngelSlim (https://github.com/Tencent/AngelSlim/tree/main)| 📚Hy-MT2 报告 (https://arxiv.org/pdf/2605.22064)

https://huggingface.co/tencent/Hy-MT2-7B#model-introduction模型介绍

Hy-MT2 是一个“快思考”多语言翻译模型家族，专为复杂的真实世界场景设计。它包含三个模型大小：1.8B、7B 和 30B-A3B（MoE），所有模型均支持 33 种语言间的翻译，并能有效遵循多语言翻译指令。在端侧部署方面，AngelSlim 1.25-bit 极致量化将 1.8B 模型的存储需求降低至仅 440 MB，并提升 1.5 倍推理速度。多维度评估显示，Hy-MT2 在通用、真实业务、垂直领域以及指令遵循翻译任务中均表现出色。7B 和 30B-A3B 模型在快思考模式下超越了 DeepSeek-V4-Pro 和 Kimi K2.6 等开源模型，而轻量级的 1.8B 模型在整体性能上也超越了微软、豆包等主流商业 API。

在此次发布中，我们还开源了 IFMTBench (https://huggingface.co/tencent/Hy-MT2-7B/blob/main/IFMTBench/README.md)，一个用于评估翻译指令遵循能力的基准测试。

我们也欢迎大家使用我们发布的 Hy-MT2-Translator Skill，它可以方便地将 Hy-MT2 系列模型集成到翻译任务中。下载链接：ClawHub (https://clawhub.ai/tencent-adm/hy-mt2-translator-skill) 和 SkillHub (https://skillhub.cn/skills/hy-mt2-translator)。

现在，腾讯混元正式与 WMT26 合作举办“视频字幕翻译任务”(https://www2.statmt.org/w26/video-subtitle-translation.html)。在“通用机器翻译任务”(https://www2.statmt.org/wmt26/translation-task.html)和“视频字幕翻译任务”中使用 Hy-MT 模型系列参赛的选手将有机会赢取由混元赞助的特别奖项。我们诚挚邀请大家参与，共同推动机器翻译技术的边界！

https://huggingface.co/tencent/Hy-MT2-7B#news新闻

2026.5.21 我们在 HuggingFace 和 ModelScope 上开源了 Hy-MT2-1.8B/Hy-MT2-7B/Hy-MT2-30B-A3B/IFMTBench。
2025.12.30 我们在 HuggingFace 和 ModelScope 上开源了 HY-MT1.5-1.8B 和 HY-MT1.5-7B。
2025.9.1 我们在 HuggingFace 和 ModelScope 上开源了 Hunyuan-MT-7B 和 Hunyuan-MT-Chimera-7B。

https://huggingface.co/tencent/Hy-MT2-7B#results结果

更多实验结果和分析请参阅我们的报告 (https://arxiv.org/pdf/2605.22064)。

https://huggingface.co/tencent/Hy-MT2-7B#model-links模型链接

模型名称描述下载链接Hy-MT2-1.8B混元 1.8B 翻译模型🤗模型 (https://huggingface.co/tencent/Hy-MT2-1.8B)Hy-MT2-1.8B-FP8混元 1.8B 翻译模型，FP8 量化🤗模型 (https://huggingface.co/tencent/Hy-MT2-1.8B-FP8)Hy-MT2-1.8B-GGUF混元 1.8B 翻译模型，llama.cpp🤗模型 (https://huggingface.co/tencent/Hy-MT2-1.8B-GGUF)Hy-MT2-1.8B-2bit-GGUF混元 1.8B 翻译模型，llama.cpp，2bit🤗模型 (https://huggingface.co/tencent/Hy-MT2-1.8B-2bit-GGUF)Hy-MT2-1.8B-1.25bit-GGUF混元 1.8B 翻译模型，llama.cpp，1.25bit🤗模型 (https://huggingface.co/tencent/Hy-MT2-1.8B-1.25bit-GGUF)Hy-MT2-7B混元 7B 翻译模型🤗模型 (https://huggingface.co/tencent/Hy-MT2-7B)Hy-MT2-7B-FP8混元 7B 翻译模型，FP8 量化🤗模型 (https://huggingface.co/tencent/Hy-MT2-7B-FP8)Hy-MT2-7B-GGUF混元 7B 翻译模型，llama.cpp🤗模型 (https://huggingface.co/tencent/Hy-MT2-7B-GGUF)Hy-MT2-30B-A3B混元 30B-A3B 翻译模型🤗模型 (https://huggingface.co/tencent/Hy-MT2-30B-A3B)Hy-MT2-30B-A3B-FP8混元 30B-A3B 翻译模型，FP8 量化🤗模型 (https://huggingface.co/tencent/Hy-MT2-30B-A3B-FP8)

https://huggingface.co/tencent/Hy-MT2-7B#hy-mt2-translation-task-instruction-examples-chinese-english-comparisonHy-MT2 翻译任务指令示例（中英文对照）

注意：以下示例中，source_lang 和 target_lang 均使用完整的语言名称。中文提示中使用中文名称，英文提示中使用英文名称。

类型中文提示英文提示默认翻译将以下文本翻译为 {target_lang}，注意只需要输出翻译后的结果，不要额外解释：{source_text}

Translate the following text into {target_lang}. Note that you shouldonly output the translated result without any additional explanation: {source_text}

术语 参考下面的翻译： {text}翻译成{text} {text}翻译成{text} {text}翻译成{text} 将以下文本翻译为 {target_lang}，注意只需要输出翻译后的结果，不要额外解释：{source_text}

Reference the following translations: {text}translates to{text} {text}translates to{text} {text}translates to{text}Translate the following text into {target_lang}. Note that you mustONLY output the translated result without any additional explanation:

{source_text}

风格请将以下文本翻译为 {target_lang}。注意翻译的风格要严格符合【{target_style}】{source_text}

Please translate the following text into {target_lang}. Note that the translation style must strictly conform to [{target_style}]: {source_text}

个性化 【待翻译文本】 {source_text}【翻译任务】 1、**{user_preferences}2、{user_preferences}**3、…… 4、将【待翻译文本】翻译为 {target_lang}。

[Source Text] {source_text}[Translation Tasks] 1. {user_preferences} 2. {user_preferences} 3. … 4. Translate the [Source Text] into {target_lang}.

分隔符 请将以下文本准确翻译为 {target_lang}。你必须在译文中保留等量的分隔符，绝对不可遗漏、转义或翻译该符号，并注意分隔符的位置。{source_text}

Please accurately translate the following text into {target_lang}. You mustretain the exact same number of delimiters in the translation. Strictly do not omit, escape, or translate these symbols, and pay close attention to their placement. {source_text}

结构化数据 1 # 任务目标 将下方 {source_text} 中的 {format_type} 格式数据翻译为 {target_lang}。# 严格约束 1. 结构锁定：绝对保持原有的 {format_type} 数据结构、缩进和层级完全不变。 2. 选择性翻译：仅翻译面向用户展示的可见文本内容。 3. 禁止修改：严禁翻译或更改任何代码标签、键名 (Key)、变量占位符（如 {{var}}、${var}、%s、%d 等）或代码属性。

# 数据输入 {source_text}

### Task Translate the user-facing text within the following {format_type} data into {target_lang}.### Strict Rules 1. Structure Preservation: You MUST preserve the original {format_type} data structure, nesting, hierarchy, and indentation exactly as they are. 2. Selective Translation: Translate ONLY the visible, user-facing text content/values. 3. Strict Non-Translation: NEVER translate or alter code tags, keys, properties, object names, or variable placeholders. Leave them exactly in their original English/code form.

### Source Data {source_text}

结构化数据 2 【背景信息】 {background_text}请结合背景信息将以下文本翻译为 {target_lang}。

【待翻译文本】 {source_text}

[Background Information] {background_text}Please translate the following text into {target_lang}, taking the provided background information into consideration.

[Source Text] {source_text}

https://huggingface.co/tencent/Hy-MT2-7B#inference-and-deployment推理与部署

对于 1.8B 和 7B，我们建议使用以下参数进行推理。注意我们的模型没有默认的 system_prompt。

{ "temperature": 0.7, "top_p": 0.6, "top_k": 20, "repetition_penalty": 1.05, "max_tokens": 4096 }

对于 30B-A3B，我们建议使用以下参数进行推理。注意我们的模型没有默认的 system_prompt。

{ "temperature": 0.7, "top_p": 1.0, "top_k": -1, "repetition_penalty": 1.0, "max_tokens": 4096 }

https://huggingface.co/tencent/Hy-MT2-7B#transformerstransformers

transformers >= 5.6.0

`` from transformers import AutoModelForCausalLM, AutoTokenizer import torch

model_path = “tencent/Hy-MT2-7B”

加载 tokenizer

tokenizer = AutoTokenizer.from_pretrained(model_path, trust_remote_code=True)

加载模型

model = AutoModelForCausalLM.from_pretrained( model_path, dtype=torch.bfloat16, device_map=“auto”, trust_remote_code=True, )

model.eval()

示例推理

prompt = “将以下文本翻译成英语,注意只需要输出翻译后的结果,不要额外解释:\n\n今天天气真好。” messages = [{“role”: “user”, “content”: prompt}] inputs = tokenizer.apply_chat_template(messages, add_generation_prompt=True, return_tensors=“pt”).to(model.device)

with torch.no_grad(): outputs = model.generate( **inputs, max_new_tokens=4096, ) response = tokenizer.decode(outputs[0][inputs[“input_ids”].shape[-1]:], skip_special_tokens=True) print(response) ``

https://huggingface.co/tencent/Hy-MT2-7B#vllmvllm

从源码构建 vLLM：

uv venv --python 3.12 --seed --managed-python source .venv/bin/activate git clone https://github.com/vllm-project/vllm.git cd vllm uv pip install --editable . --torch-backend=auto

启动 vLLM 服务器：

vllm serve tencent/Hy-MT2-7B --tensor-parallel-size 1

https://huggingface.co/tencent/Hy-MT2-7B#sglangsglang

从源码构建 SGLang：

git clone https://github.com/sgl-project/sglang cd sglang pip3 install pip --upgrade pip3 install "transformers>=5.6.0" pip3 install -e "python"

启动 SGLang 服务器：

python3 -m sglang.launch_server --model tencent/Hy-MT2-7B --tp 1

https://huggingface.co/tencent/Hy-MT2-7B#llama_cppllama_cpp

❕❕ 此 gguf 依赖于我们的 STK 内核，该内核已在 PR #22836 (https://github.com/ggml-org/llama.cpp/pull/22836) 中发布。

https://huggingface.co/tencent/Hy-MT2-7B#clone-llamacpp克隆 llama.cpp

git clone https://github.com/ggml-org/llama.cpp.git

https://huggingface.co/tencent/Hy-MT2-7B#enter-the-llamacpp-folder进入 llama.cpp 文件夹

cd llama.cpp

https://huggingface.co/tencent/Hy-MT2-7B#build-llamacpp构建 llama.cpp

cmake -B build cmake --build build --config Release

https://huggingface.co/tencent/Hy-MT2-7B#run-a-completion-example运行 completion 示例

./build/bin/llama-completion \ --model model.gguf \ -p "Translate the following segment into Chinese, without additional explanation:Hello" \ --jinja \ -ngl 0 \ -n 64 -st

https://huggingface.co/tencent/Hy-MT2-7B#run-the-llamacpp-benchmark运行 llama.cpp 基准测试

./build/bin/llama-bench -m model_zoo/model.gguf -ngl 0

https://huggingface.co/tencent/Hy-MT2-7B#model-training模型训练

Hy-MT2 提供完整的模型训练流程，支持全参数微调和 LoRA 微调，以及多种 DeepSpeed ZeRO 配置和 LLaMA-Factory 集成。

更多训练文档请参阅：模型训练指南 (https://huggingface.co/tencent/Hy-MT2-7B/blob/main/train/README.md)

https://huggingface.co/tencent/Hy-MT2-7B#quantization-tool量化工具

我们提供了 AngelSlim (https://github.com/tencent/AngelSlim)，一个易于使用、全面且高效的大模型压缩工具包，涵盖常见量化算法、低比特量化、投机采样等。

https://huggingface.co/tencent/Hy-MT2-7B#supported-languages支持的语言

语言缩写中文名称Chinesezh中文Englishen英语Frenchfr法语Portuguesept葡萄牙语Spanishes西班牙语Japaneseja日语Turkishtr土耳其语Russianru俄语Arabicar阿拉伯语Koreanko韩语Thaith泰语Italianit意大利语Germande德语Vietnamesevi越南语Malayms马来语Indonesianid印尼语Filipinotl菲律宾语Hindihi印地语Traditional Chinesezh-Hant繁体中文Polishpl波兰语Czechcs捷克语Dutchnl荷兰语Khmerkm高棉语Burmesemy缅甸语Persianfa波斯语Gujaratigu古吉拉特语Urduur乌尔都语Telugute泰卢固语Marathimr马拉地语Hebrewhe希伯来语Bengalibn孟加拉语Tamilta泰米尔语Ukrainianuk乌克兰语Tibetanbo藏语Kazakhkk哈萨克语Mongolianmn蒙古语Uyghurug维吾尔语Cantoneseyue粤语

https://huggingface.co/tencent/Hy-MT2-7B#citing-hy-mt2引用 Hy-MT2

@misc{zheng2026hymt2familyfastefficient, title={Hy-MT2: A Family of Fast, Efficient and Powerful Multilingual Translation Models in the Wild}, author={Mao Zheng and Zheng Li and Tao Chen and Bo Lv and Mingrui Sun and Mingyang Song and Jinlong Song and Hong Huang and Decheng Wu and Hai Wang and Yifan Song and Yanfeng Chen and Guanwei Zhang}, year={2026}, eprint={2605.22064}, archivePrefix={arXiv}, primaryClass={cs.CL}, url={https://arxiv.org/abs/2605.22064}, }

https://huggingface.co/tencent/Hy-MT2-7B#contact-us联系我们

如果您想向我们的研发和产品团队提供反馈，欢迎联系腾讯混元大模型团队。您可以通过电子邮件 [email protected] 与我们联系。

相似文章

Hy-MT2：一套快速、高效且强大的多语言翻译模型，面向实际应用

arXiv cs.CL

Hy-MT2 是腾讯推出的一套快速高效的多语言翻译模型，提供 1.8B、7B 和 30B-A3B 三种尺寸，支持 33 种语言，性能超越此前开源和商业模型。

@FeitengLi: Hy-MT2 新的开源多语言翻译模型能力可以跟顶尖大模型匹配、支持 33 种语言之间的互译，而且具备灵活的 Instruct 能力，实现了 2-bit 量化不足 500MB 的空间占用可以很好的跑在端侧 https://modelsc…

X AI KOLs Timeline

Hy-MT2 is a new open-source multilingual translation model from Tencent Hy that supports 33 languages, offers flexible instruction capabilities, and achieves 2-bit quantization under 500MB for on-device deployment.

tencent/Hy-MT2-7B

tencent/Hy-MT2-7B · Hugging Face

https://huggingface.co/tencent/Hy-MT2-7B#model-introduction模型介绍

https://huggingface.co/tencent/Hy-MT2-7B#news新闻

https://huggingface.co/tencent/Hy-MT2-7B#results结果

https://huggingface.co/tencent/Hy-MT2-7B#model-links模型链接

https://huggingface.co/tencent/Hy-MT2-7B#hy-mt2-translation-task-instruction-examples-chinese-english-comparisonHy-MT2 翻译任务指令示例（中英文对照）

https://huggingface.co/tencent/Hy-MT2-7B#inference-and-deployment推理与部署

https://huggingface.co/tencent/Hy-MT2-7B#transformerstransformers

加载 tokenizer

加载模型

示例推理

https://huggingface.co/tencent/Hy-MT2-7B#vllmvllm

https://huggingface.co/tencent/Hy-MT2-7B#sglangsglang

https://huggingface.co/tencent/Hy-MT2-7B#llama_cppllama_cpp

https://huggingface.co/tencent/Hy-MT2-7B#clone-llamacpp克隆 llama.cpp

https://huggingface.co/tencent/Hy-MT2-7B#enter-the-llamacpp-folder进入 llama.cpp 文件夹

https://huggingface.co/tencent/Hy-MT2-7B#build-llamacpp构建 llama.cpp

https://huggingface.co/tencent/Hy-MT2-7B#run-a-completion-example运行 completion 示例

https://huggingface.co/tencent/Hy-MT2-7B#run-the-llamacpp-benchmark运行 llama.cpp 基准测试

https://huggingface.co/tencent/Hy-MT2-7B#model-training模型训练

https://huggingface.co/tencent/Hy-MT2-7B#quantization-tool量化工具

https://huggingface.co/tencent/Hy-MT2-7B#supported-languages支持的语言

https://huggingface.co/tencent/Hy-MT2-7B#citing-hy-mt2引用 Hy-MT2

https://huggingface.co/tencent/Hy-MT2-7B#contact-us联系我们

相似文章

Hy-MT2：一套快速、高效且强大的多语言翻译模型，面向实际应用

@FeitengLi: Hy-MT2 新的开源多语言翻译模型能力可以跟顶尖大模型匹配、支持 33 种语言之间的互译，而且具备灵活的 Instruct 能力，实现了 2-bit 量化不足 500MB 的空间占用可以很好的跑在端侧 https://modelsc…

AngelSlim/Hy-MT1.5-1.8B-1.25bit

Tencent Hy-MT2 现已采用 Apache License 2.0

@FeitengLi: 99M 参数的 TTS 跑在 CPU 上，比 2B 大模型跑在 A100 上还快。 Supertone 新开源的 supertonic-3 ONNX Runtime，完全本地，浏览器能跑，手机能跑，树莓派也能跑。

提交意见反馈

tencent/Hy-MT2-7B · Hugging Face

https://huggingface.co/tencent/Hy-MT2-7B#model-introduction模型介绍

https://huggingface.co/tencent/Hy-MT2-7B#news新闻

https://huggingface.co/tencent/Hy-MT2-7B#results结果

https://huggingface.co/tencent/Hy-MT2-7B#model-links模型链接

https://huggingface.co/tencent/Hy-MT2-7B#hy-mt2-translation-task-instruction-examples-chinese-english-comparisonHy-MT2 翻译任务指令示例（中英文对照）

https://huggingface.co/tencent/Hy-MT2-7B#inference-and-deployment推理与部署

https://huggingface.co/tencent/Hy-MT2-7B#transformerstransformers

加载 tokenizer

加载模型

示例推理

https://huggingface.co/tencent/Hy-MT2-7B#vllmvllm

https://huggingface.co/tencent/Hy-MT2-7B#sglangsglang

https://huggingface.co/tencent/Hy-MT2-7B#llama_cppllama_cpp

https://huggingface.co/tencent/Hy-MT2-7B#clone-llamacpp克隆 llama.cpp

https://huggingface.co/tencent/Hy-MT2-7B#enter-the-llamacpp-folder进入 llama.cpp 文件夹

https://huggingface.co/tencent/Hy-MT2-7B#build-llamacpp构建 llama.cpp

https://huggingface.co/tencent/Hy-MT2-7B#run-a-completion-example运行 completion 示例

https://huggingface.co/tencent/Hy-MT2-7B#run-the-llamacpp-benchmark运行 llama.cpp 基准测试

https://huggingface.co/tencent/Hy-MT2-7B#model-training模型训练

https://huggingface.co/tencent/Hy-MT2-7B#quantization-tool量化工具

https://huggingface.co/tencent/Hy-MT2-7B#supported-languages支持的语言

https://huggingface.co/tencent/Hy-MT2-7B#citing-hy-mt2引用 Hy-MT2

https://huggingface.co/tencent/Hy-MT2-7B#contact-us联系我们

相似文章

Hy-MT2：一套快速、高效且强大的多语言翻译模型，面向实际应用

@FeitengLi: Hy-MT2 新的开源多语言翻译模型 能力可以跟顶尖大模型匹配、支持 33 种语言之间的互译，而且具备灵活的 Instruct 能力，实现了 2-bit 量化不足 500MB 的空间占用 可以很好的跑在端侧 https://modelsc…

AngelSlim/Hy-MT1.5-1.8B-1.25bit

Tencent Hy-MT2 现已采用 Apache License 2.0

@FeitengLi: 99M 参数的 TTS 跑在 CPU 上，比 2B 大模型跑在 A100 上还快。 Supertone 新开源的 supertonic-3 ONNX Runtime，完全本地，浏览器能跑，手机能跑，树莓派也能跑。

提交意见反馈

@FeitengLi: Hy-MT2 新的开源多语言翻译模型能力可以跟顶尖大模型匹配、支持 33 种语言之间的互译，而且具备灵活的 Instruct 能力，实现了 2-bit 量化不足 500MB 的空间占用可以很好的跑在端侧 https://modelsc…