A Foundation Model for Generative Recommendation, A Unified Definition of Hallucination, and More!
Summary
This week's newsletter from RecSys highlights ten recent research advancements in information retrieval and recommender systems. Key developments include Kuaishou's "OpenOneRec," a foundation model for generative recommendation, and a unified definition of hallucination from Liu et al. A collaboration between PayPal and NVIDIA focuses on fine-tuning small language models for e-commerce agent optimization using NVIDIA's NeMo framework. Alibaba explores enhancing item-to-item recommendations with LLM-based data generation, while Tencent addresses inference bottlenecks in generative slate recommendation through hierarchical planning. Other research covers evaluating factuality in explainable recommenders, selective regularization for LLM knowledge integration, efficient billion-scale Approximate Nearest Neighbor Search (ANNS), instruction-following generative recommendation from JD.com, and Google Research's work on teaching LLMs to admit ignorance.
Key takeaway
For AI engineers developing recommender systems, these advancements indicate a strong trend towards integrating LLMs for both generation and efficiency. You should explore frameworks like NVIDIA NeMo for fine-tuning small language models and consider hierarchical planning to overcome inference bottlenecks in generative recommendation. Prioritize research into factuality and hallucination to build more reliable and explainable systems.
Key insights
Recent research advances generative recommendation, LLM integration, and efficiency in information retrieval systems.
Principles
- LLMs can enhance recommendation data generation.
- Hierarchical planning improves generative slate recommendation.
- Defining hallucination is crucial for LLM reliability.
Method
Methods include fine-tuning small language models for e-commerce agents, using LLM-based data generation for item-to-item recommendations, and applying hierarchical planning for generative slate recommendation.
In practice
- Utilize NVIDIA's NeMo for e-commerce agent optimization.
- Employ LLMs to generate synthetic data for recommenders.
- Implement hierarchical planning for efficient slate generation.
Topics
- Generative Recommendation
- Large Language Models
- E-commerce AI
- Approximate Nearest Neighbor Search
- LLM Hallucination
Best for: AI Engineer, NLP Engineer, Research Scientist, AI Researcher, AI Scientist, Machine Learning Engineer
Related on AIssential
Editorial summary, takeaway, and curation by AIssential. Original article published by Top Information Retrieval Papers of the Week.