Not All RecSys Problems Are Created Equal
Summary
Recommender systems (RecSys) vary significantly in complexity, with industry giants like TikTok and Spotify employing advanced deep learning models, while most practitioners utilize simpler, tabular models such as gradient-boosted trees. This distinction arises from differences in candidate generation, ranking complexity, and the nature of user preferences. Candidate generation can range from simple filtering, as seen with Booking.com, to complex machine learning for vast catalogs like Amazon. Ranking complexity depends on observable outcomes, catalog stability, and preference subjectivity. Businesses with directly observable outcomes and stable catalogs, like IKEA, can rely on strong baselines. Conversely, platforms with weak signals (Yelp) or high-churn catalogs (Zillow) require more sophisticated ML. Deep learning is primarily justified in domains with highly subjective preferences and dense behavioral data, such as YouTube or Spotify, where personalization offers immense value.
Key takeaway
For AI Architects evaluating recommender system solutions, understand that the "state-of-the-art" models from tech giants are tailored to extreme conditions. You should first map your problem's constraints regarding catalog stability, signal strength, and preference subjectivity. This will guide you to select the most appropriate and cost-effective solution, often a gradient-boosted tree model, rather than over-engineering with deep learning when simpler approaches suffice for your specific data and business needs.
Key insights
RecSys complexity is dictated by catalog stability, signal strength, and preference subjectivity, not universal architectural mandates.
Principles
- Strong baselines emerge from directly observable outcomes.
- Position bias distorts weak upper-funnel signals.
- Deep learning excels with subjective taste and dense data.
Method
Assess RecSys problem complexity by evaluating candidate generation needs, observable outcomes, catalog stability, and the subjectivity of user preferences to determine appropriate ML model complexity.
In practice
- Use GBDTs for stable catalogs and observable outcomes.
- Employ ML for high-churn catalogs or weak signals.
- Apply embeddings as features for GBDTs to personalize.
Topics
- Recommender Systems
- Gradient-Boosted Trees
- Deep Learning Models
- Candidate Generation
- Ranking Algorithms
Best for: AI Architect, AI Product Manager, Product Manager, Machine Learning Engineer, AI Engineer, Data Scientist
Related on AIssential
Editorial summary, takeaway, and curation by AIssential. Original article published by Towards Data Science.