tencent/Hy-MT2-1.8B
Summary
Tencent open-sourced Hy-MT2, a family of multilingual translation models in sizes 1.8B, 7B, and 30B-A3B (MoE), supporting 33 languages and achieving state-of-the-art results, with the lightweight 1.8B model outperforming mainstream commercial APIs from Microsoft and Doubao.
View Cached Full Text
Cached at: 05/22/26, 07:43 AM
tencent/Hy-MT2-1.8B · Hugging Face
Source: https://huggingface.co/tencent/Hy-MT2-1.8B English |中文

🖥️Official Website| 💬GitHub| 🪡AngelSlim| 📚Hy-MT2 Report
https://huggingface.co/tencent/Hy-MT2-1.8B#model-introductionModel Introduction
Hy-MT2 is a family of “fast-thinking” multilingual translation models designed for complex real-world scenarios. It includes three model sizes: 1.8B, 7B, and 30B-A3B (MoE), all of which support translation among 33 languages and effectively follow translation instructions in multiple languages. For on-device deployment, AngelSlim 1.25-bit extreme quantization reduces the storage requirement of the 1.8B model to only 440 MB and improves inference speed by 1.5x. Multi-dimensional evaluations show that Hy-MT2 delivers outstanding performance across general, real-world business, domain-specific, and instruction-following translation tasks. The 7B and 30B-A3B models outperform open-source models such as DeepSeek-V4-Pro and Kimi K2.6 in fast-thinking mode, while the lightweight 1.8B model also surpasses mainstream commercial APIs from providers such as Microsoft and Doubao overall.
In this release, we also open-sourceIFMTBench, a benchmark for evaluating translation instruction-following capabilities.
We also welcome everyone to use our released Hy-MT2-Translator Skill, which makes it easy to integrate Hy-MT2 series models for translation tasks. Download links:ClawHubandSkillHub.
Now, Tencent Hy is officially partnering with WMT26 for the “Video Subtitle Translation Task” (https://www2.statmt.org/wmt26/video-subtitle-translation.html). Participants who use the Hy-MT model series to compete in the “General Machine Translation Task” (https://www2.statmt.org/wmt26/translation-task.html) and the “Video Subtitle Translation Task” will have the chance to win special awards sponsored by Hunyuan. We sincerely invite everyone to participate and jointly push the boundaries of machine translation technology!
https://huggingface.co/tencent/Hy-MT2-1.8B#newsNews
- 2026.5.21 We open-sourcedHy-MT2-1.8B/Hy-MT2-7B/Hy-MT2-30B-A3B/IFMTBenchon HuggingFace and ModelScope.
- 2025.12.30 We open-sourcedHY-MT1.5-1.8BandHY-MT1.5-7Bon HuggingFace and ModelScope.
- 2025.9.1 We open-sourcedHunyuan-MT-7BandHunyuan-MT-Chimera-7Bon HuggingFace and ModelScope.
https://huggingface.co/tencent/Hy-MT2-1.8B#resultsResults

For more experimental results and analysis, please refer to ourreport.
https://huggingface.co/tencent/Hy-MT2-1.8B#model-linksModel Links
Model NameDescriptionDownload LinkHy-MT2-1.8BHy 1.8B translation model🤗ModelHy-MT2-1.8B-FP8Hy 1.8B translation model, FP8 quantization🤗ModelHy-MT2-1.8B-GGUFHy 1.8B translation model, llama.cpp🤗ModelHy-MT2-1.8B-2bit-GGUFHy 1.8B translation model, llama.cpp, 2bit🤗ModelHy-MT2-1.8B-1.25bit-GGUFHy 1.8B translation model, llama.cpp, 1.25bit🤗ModelHy-MT2-7BHy 7B translation model🤗ModelHy-MT2-7B-FP8Hy 7B translation model, FP8 quantization🤗ModelHy-MT2-7B-GGUFHy 7B translation model, llama.cpp🤗ModelHy-MT2-30B-A3BHy 30B-A3B translation model🤗ModelHy-MT2-30B-A3B-FP8Hy 30B-A3B translation model, FP8 quantization🤗Model
https://huggingface.co/tencent/Hy-MT2-1.8B#hy-mt2-translation-task-instruction-examples-chinese-english-comparisonHy-MT2 Translation Task Instruction Examples (Chinese-English Comparison)
Note: In the following examples, both source_lang and target_lang should use the full language names. Chinese names should be used in Chinese prompts, and English names should be used in English prompts.
TypeChinese promptEnglish promptDefault Translation将以下文本翻译为\{target\_lang\},注意只需要输出翻译后的结果,不要额外解释:\{source\_text\}
Translate the following text into\{target\_lang\}. Note that you shouldonly output the translated result without any additional explanation:\{source\_text\}
Terminology参考下面的翻译:
\{text\}翻译成\{text\}
\{text\}翻译成\{text\}
\{text\}翻译成\{text\}
将以下文本翻译为\{target\_lang\},注意只需要输出翻译后的结果,不要额外解释:\{source\_text\}
Reference the following translations:
\{text\}translates to\{text\}
\{text\}translates to\{text\}
\{text\}translates to\{text\}Translate the following text into\{target\_lang\}. Note that you mustONLY output the translated result without any additional explanation:
\{source\_text\}
Style请将以下文本翻译为\{target\_lang\}。
注意翻译的风格要严格符合【**\{target\_style\}**】\{source\_text\}
Please translate the following text into\{target\_lang\}. Note that the translation style must strictly conform to [\{target\_style\}]:\{source\_text\}
Personalization*【待翻译文本】*
\{source\_text\}【翻译任务】 1、**\{user\_preferences\}2、\{user\_preferences\}** 3、…… 4、将【待翻译文本】翻译为\{target\_lang\}。
[Source Text]
\{source\_text\}[Translation Tasks] 1.\{user\_preferences\} 2.\{user\_preferences\} 3. ... 4. Translate the [Source Text] into\{target\_lang\}.
Delimiters请将以下文本准确翻译为\{target\_lang\}。
你必须在译文中保留等量的分隔符,绝对不可遗漏、转义或翻译该符号,并注意分隔符的位置。\{source\_text\}
Please accurately translate the following text into\{target\_lang\}.
You mustretain the exact same number of delimiters in the translation. Strictly do not omit, escape, or translate these symbols, and pay close attention to their placement.\{source\_text\}
Structured Data 1*# 任务目标*
将下方\{source\_text\}中的\{format\_type\}格式数据翻译为\{target\_lang\}。# 严格约束 1.结构锁定:绝对保持原有的\{format\_type\}数据结构、缩进和层级完全不变。 2.选择性翻译:仅翻译面向用户展示的可见文本内容。 3.禁止修改:严禁翻译或更改任何代码标签、键名 (Key)、变量占位符(如\{\{var\}\}、$\{var\}、%s、%d等)或代码属性。
# 数据输入 \{source\_text\}
### Task
Translate the user-facing text within the following\{format\_type\}data into\{target\_lang\}.### Strict Rules 1.**Structure Preservation:**You MUST preserve the original\{format\_type\}data structure, nesting, hierarchy, and indentation exactly as they are. 2.**Selective Translation:**Translate ONLY the visible, user-facing text content/values. 3.**Strict Non-Translation:**NEVER translate or alter code tags, keys, properties, object names, or variable placeholders. Leave them exactly in their original English/code form.
### Source Data \{source\_text\}
Structured Data 2*【背景信息】*
\{background\_text\}请结合背景信息将以下文本翻译为\{target\_lang\}。
【待翻译文本】 \{source\_text\}
[Background Information]
\{background\_text\}Please translate the following text into\{target\_lang\}, taking the provided background information into consideration.
[Source Text] \{source\_text\}
https://huggingface.co/tencent/Hy-MT2-1.8B#inference-and-deploymentInference and Deployment
For 1.8B and 7B, we recommend using the following parameters for inference. Note that our models do not have a default system_prompt.
{
"temperature": 0.7,
"top_p": 0.6,
"top_k": 20,
"repetition_penalty": 1.05,
"max_tokens": 4096
}
For 30B-A3B, we recommend using the following parameters for inference. Note that our models do not have a default system_prompt.
{
"temperature": 0.7,
"top_p": 1.0,
"top_k": -1,
"repetition_penalty": 1.0,
"max_tokens": 4096
}
https://huggingface.co/tencent/Hy-MT2-1.8B#transformerstransformers
transformers>=5.6.0
from transformers import AutoModelForCausalLM, AutoTokenizer
import torch
model_path = "tencent/Hy-MT2-1.8B"
# Load tokenizer
tokenizer = AutoTokenizer.from_pretrained(model_path, trust_remote_code=True)
# Load model
model = AutoModelForCausalLM.from_pretrained(
model_path,
dtype=torch.bfloat16,
device_map="auto",
trust_remote_code=True,
)
model.eval()
# Example inference
prompt = "将以下文本翻译成英语,注意只需要输出翻译后的结果,不要额外解释:\n\n今天天气真好。"
messages = [{"role": "user", "content": prompt}]
inputs = tokenizer.apply_chat_template(messages, add_generation_prompt=True, return_tensors="pt").to(model.device)
with torch.no_grad():
outputs = model.generate(
**inputs,
max_new_tokens=4096,
)
response = tokenizer.decode(outputs[0][inputs["input_ids"].shape[-1]:], skip_special_tokens=True)
print(response)
https://huggingface.co/tencent/Hy-MT2-1.8B#vllmvllm
Build vLLM from source:
uv venv --python 3.12 --seed --managed-python
source .venv/bin/activate
git clone https://github.com/vllm-project/vllm.git
cd vllm
uv pip install --editable . --torch-backend=auto
Start the vLLM server:
vllm serve tencent/Hy-MT2-1.8B --tensor-parallel-size 1
https://huggingface.co/tencent/Hy-MT2-1.8B#sglangsglang
Build SGLang from source:
git clone https://github.com/sgl-project/sglang
cd sglang
pip3 install pip --upgrade
pip3 install "transformers>=5.6.0"
pip3 install -e "python"
Launch SGLang server:
python3 -m sglang.launch_server --model tencent/Hy-MT2-1.8B --tp 1
https://huggingface.co/tencent/Hy-MT2-1.8B#model-trainingModel Training
Hy-MT2 provides a complete model training pipeline, supporting both full-parameter fine-tuning and LoRA fine-tuning, as well as multiple DeepSpeed ZeRO configurations and LLaMA-Factory integration.
For detailed training documentation, please refer to:Model Training Guide
https://huggingface.co/tencent/Hy-MT2-1.8B#quantization-toolQuantization Tool
We provideAngelSlim, an easy-to-use, comprehensive, and efficient large model compression toolkit covering common quantization algorithms, low-bit quantization, speculative sampling, and more.
https://huggingface.co/tencent/Hy-MT2-1.8B#supported-languagesSupported Languages
LanguagesAbbr.Chinese NamesChinesezh中文Englishen英语Frenchfr法语Portuguesept葡萄牙语Spanishes西班牙语Japaneseja日语Turkishtr土耳其语Russianru俄语Arabicar阿拉伯语Koreanko韩语Thaith泰语Italianit意大利语Germande德语Vietnamesevi越南语Malayms马来语Indonesianid印尼语Filipinotl菲律宾语Hindihi印地语Traditional Chinesezh-Hant繁体中文Polishpl波兰语Czechcs捷克语Dutchnl荷兰语Khmerkm高棉语Burmesemy缅甸语Persianfa波斯语Gujaratigu古吉拉特语Urduur乌尔都语Telugute泰卢固语Marathimr马拉地语Hebrewhe希伯来语Bengalibn孟加拉语Tamilta泰米尔语Ukrainianuk乌克兰语Tibetanbo藏语Kazakhkk哈萨克语Mongolianmn蒙古语Uyghurug维吾尔语Cantoneseyue粤语
https://huggingface.co/tencent/Hy-MT2-1.8B#citing-hy-mt2Citing Hy-MT2
@misc{zheng2026hymt2familyfastefficient,
title={Hy-MT2: A Family of Fast, Efficient and Powerful Multilingual Translation Models in the Wild},
author={Mao Zheng and Zheng Li and Tao Chen and Bo Lv and Mingrui Sun and Mingyang Song and Jinlong Song and Hong Huang and Decheng Wu and Hai Wang and Yifan Song and Yanfeng Chen and Guanwei Zhang},
year={2026},
eprint={2605.22064},
archivePrefix={arXiv},
primaryClass={cs.CL},
url={https://arxiv.org/abs/2605.22064},
}
https://huggingface.co/tencent/Hy-MT2-1.8B#contact-usContact Us
If you would like to leave feedback for our R&D and product teams, you are welcome to contact the Tencent Hunyuan LLM team. You can reach us by email at[email protected].
Similar Articles
MooreThreads/MusaCoder-27B • Huggingface
MooreThreads releases MusaCoder-27B, a 27-billion-parameter code generation model, accompanied by a paper on arXiv.
17 bugs in 10 weeks from AI security scanning
AI-based security scanning has discovered 17 bugs in Perfetto's trace processor over 10 weeks, highlighting the potential for AI to uncover vulnerabilities in long-tail code that previously received little attention.
Is This the Best Time Ever to Build AI Products?
The post discusses how falling barriers to entry in AI development, through free AI agent builders, open-source models, and no-code tools, are enabling solo founders to launch products faster than ever before.
How useful is qwopus compared to qwen3.6 27b
A user asks for community input on the practical usefulness of qwopus compared to qwen3.6 27b, particularly for agentic coding tasks, reporting mixed opinions and minimal personal differences in testing.
Show HN: macOS menu bar gauges for your Claude Code quota
A macOS menu bar utility that displays real-time Claude Code API quota usage via SwiftBar, supporting multiple accounts and showing utilization bars with countdowns when limits are reached.