Tencent Researchers Release Tencent HY-MT1.5: A New Translation Models Featuring 1.8B and 7B Models Designed for Seamless on-Device and Cloud Deployment
Summary
Tencent Hunyuan researchers have released HY-MT1.5, a new multilingual machine translation family comprising two models: HY-MT1.5-1.8B and HY-MT1.5-7B. These models are designed for both mobile devices and cloud systems, supporting mutual translation across 33 languages with 5 ethnic and dialect variations. Available on GitHub and Hugging Face with open weights, HY-MT1.5-7B is an upgraded version of the WMT25 championship system, optimized for explanatory and mixed-language scenarios, and includes native support for terminology intervention, contextual translation, and formatted translation. The compact HY-MT1.5-1.8B, with less than one-third the parameters, delivers comparable performance and, after quantization, can run on edge devices with approximately 1 GB of memory, achieving an average response time of 0.18 seconds for 50-token Chinese inputs while surpassing mainstream commercial APIs in quality. Both models are trained using a holistic, multi-stage pipeline that includes general pre-training, MT-oriented pre-training, supervised fine-tuning, on-policy distillation from 7B to 1.8B, and reinforcement learning with rubrics-based evaluation.
Key takeaway
For AI Engineers and Machine Learning Engineers deploying machine translation, HY-MT1.5 offers a compelling open-weight solution. Your teams can leverage the HY-MT1.5-1.8B model for efficient edge deployment on devices with 1 GB memory, achieving high quality and low latency. Consider integrating its prompt-driven features for terminology control, context-aware translation, and format preservation to enhance production system robustness and accuracy.
Key insights
Tencent's HY-MT1.5 offers high-quality, open-weight machine translation models optimized for diverse deployment scenarios.
Principles
- Distillation enables smaller models to inherit larger model behavior.
- Rubrics-based RL fine-tuning improves translation quality.
- Prompt-driven features enhance production system utility.
Method
A multi-stage pipeline combines general and MT-oriented pre-training, supervised fine-tuning, on-policy distillation, and reinforcement learning with rubrics-based human evaluation to optimize translation models.
In practice
- Use HY-MT1.5-1.8B for on-device, low-latency translation.
- Employ prompt templates for terminology, context, and format control.
- Utilize FP8 or Int4 quantization for memory-constrained deployment.
Topics
- Multilingual Machine Translation
- On-Device AI
- Model Distillation
- Reinforcement Learning
- Prompt Engineering
Best for: AI Architect, CTO, VP of Engineering/Data, AI Engineer, Machine Learning Engineer, NLP Engineer
Related on AIssential
Editorial summary, takeaway, and curation by AIssential. Original article published by MarkTechPost.