From RNNs to Transformers: Building Sequential Recommenders (Part 1)
Summary
This article, "The Generative Recommender Revolution, Part 1," traces the evolution of sequential recommendation systems, highlighting the shift from traditional classification-based approaches to generative models. It begins by formalizing the sequential problem, where the order of user interactions (e.g., [i₁, i₂, i₃. .., iₙ]) is crucial for predicting the next item (iₙ₊₁), unlike collaborative filtering which treats interactions as an unordered bag. The post uses the Steam Games dataset, comprising 56,808 users, 6,382 games, and 2,235,453 interactions after 5-core filtering, as a benchmark. It then details GRU4Rec (2016), the first major work applying Recurrent Neural Networks (RNNs) to session-based recommendations, which significantly influenced the field for two years.
Key takeaway
For AI Architects and Machine Learning Engineers designing recommendation systems, understanding the transition from traditional collaborative filtering to sequential and generative models is critical. Your current pipeline, which might involve multi-stage retrieval and ranking, could be simplified and improved by exploring generative recommenders that directly predict the next item. Consider evaluating GRU4Rec as a foundational sequential model, especially for session-based contexts, to enhance prediction accuracy by accounting for interaction order.
Key insights
Sequential recommendation models predict the next item by considering the order of past user interactions.
Principles
- Order of interactions matters for next-item prediction.
- Generative models can subsume classification tasks.
Method
Sequential recommendation models predict iₙ₊₁ given [i₁, i₂, i₃. .., iₙ], applicable across e-commerce, streaming, music, and news domains.
In practice
- Use 5-core filtering for RecSys datasets.
- Apply GRU4Rec for session-based recommendations.
Topics
- Sequential Recommendation
- Generative Recommenders
- GRU4Rec
- SASRec
- BERT4Rec
Best for: Machine Learning Engineer, AI Engineer, AI Architect
Related on AIssential
Editorial summary, takeaway, and curation by AIssential. Original article published by MLWhiz: Recs|ML|GenAI.