HiGR: Industrial-Scale Hierarchical Generative Slate Recommendation Framework in Tencent
Summary
Tencent has developed HiGR, a hierarchical generative slate recommendation framework designed to enhance efficiency and quality in online platforms. Released on June 5, 2009, HiGR addresses issues in prior autoregressive models, such as entangled item tokenization and inefficient sequential decoding. It features a Contrastive Residual Quantization VAE (CRQ-VAE) for semantically structured item IDs, achieving a 2.37% collision rate and 66.47% consistency. The framework also employs a Hierarchical Slate Decoder (HSD) that separates list-level planning from item-level decoding for coarse-to-fine generation. Furthermore, HiGR integrates a reference-model-free Odds Ratio Preference Optimization (ORPO) for listwise preference alignment, optimizing for ranking fidelity, interest, and diversity. Offline evaluations on a large-scale commercial media platform showed over 10% improvement in recommendation quality and a 5x inference speedup. Online A/B tests demonstrated a 1.22% increase in Average Watch Time and a 1.73% increase in Average Video Views.
Key takeaway
For Machine Learning Engineers building or optimizing generative slate recommendation systems, HiGR offers a proven approach to overcome efficiency and quality bottlenecks. You should consider adopting its hierarchical planning and contrastive semantic ID generation to achieve significant inference speedups and improved user engagement metrics. Implement listwise preference alignment with ORPO to directly optimize for holistic slate quality, ensuring your models are both performant and user-centric.
Key insights
Hierarchical generative models with preference alignment significantly boost slate recommendation efficiency and quality.
Principles
- Decouple generation into global planning and specific item selection.
- Enforce semantic consistency in item IDs via contrastive learning.
- Optimize slate quality directly using listwise preference alignment.
Method
HiGR uses CRQ-VAE for structured item tokenization, then a Hierarchical Slate Decoder for coarse-to-fine generation. It applies ORPO for listwise preference alignment, optimizing ranking, interest, and diversity.
In practice
- Use CRQ-VAE for semantically consistent item ID generation.
- Implement hierarchical decoding for efficient slate planning.
- Apply ORPO for stable, reference-model-free preference optimization.
Topics
- Slate Recommendation
- Generative Recommendation
- Hierarchical Planning
- Preference Alignment
- Contrastive Learning
- Semantic IDs
Best for: MLOps Engineer, AI Engineer, AI Architect, AI Scientist, Machine Learning Engineer, Director of AI/ML
Related on AIssential
Editorial summary, takeaway, and curation by AIssential. Original article published by cs.AI updates on arXiv.org.