Tencent uses product rollout, not just benchmarks, to define Hy3 preview

· Source: Dataconomy · Field: Technology & Digital — Artificial Intelligence & Machine Learning, Software Development & Engineering, Robotics & Autonomous Systems · Depth: Advanced, quick

Summary

Tencent has launched Hy3 preview, a substantial fast-and-slow-thinking fused Mixture-of-Experts (MoE) language model featuring 295 billion total parameters and 21 billion activated parameters, supporting up to 256K context. The company claims a 40% improvement in inference efficiency and strong performance across reasoning, instruction following, in-context learning, coding, and agentic tasks, validated across over 50 evaluation sets. Crucially, Tencent is emphasizing real-world product integration, having already embedded Hy3 preview into Yuanbao, CodeBuddy, WorkBuddy, ima, Tencent Docs, and Peacekeeper Elite. This integration has reportedly reduced first-token latency by 54%, cut end-to-end duration by 47%, and achieved task success rates over 99.99% in applications like CodeBuddy and WorkBuddy. The model also supports complex agent workflows up to 495 steps in real user environments, with pricing starting at RMB 1.2 per million input tokens and RMB 4 per million output tokens via TokenHub.

Key takeaway

For AI Architects evaluating new large language models, prioritize vendors demonstrating concrete product performance and real-world integration metrics over abstract benchmark claims. Your teams should consider models like Tencent's Hy3 preview that offer tangible improvements in latency, task success rates, and complex agent workflow stability within existing applications, as this indicates practical deployability and value beyond lab evaluations. Leverage free trial periods to assess real-world fit.

Key insights

Tencent's Hy3 preview emphasizes real-world product integration and performance over isolated benchmark claims.

Principles

Method

Tencent employs product co-design and open-source feedback to refine Hy3 preview's performance in live scenarios, rebuilding its Hunyuan model line.

In practice

Topics

Best for: CTO, VP of Engineering/Data, AI Architect, AI Engineer, Machine Learning Engineer, Director of AI/ML

Related on AIssential

Open in AIssential →

Editorial summary, takeaway, and curation by AIssential. Original article published by Dataconomy.