FlowTime: Towards Continuous Generative Watch Time Prediction via Flow-based Personalized Priors
Summary
FlowTime introduces a novel Continuous Generative Regression paradigm for watch time prediction (WTP) in short-video recommender systems, addressing limitations of existing methods like Direct Regression's mean-collapse and Ordinal Regression's quantization errors. Current approaches also fail to capture the intrinsic multimodality and heterogeneity of User-Item Interaction Patterns. FlowTime employs a One-step Generative Variational Autoencoder to avoid iterative denoising latency and designs a Flow-based Personalized Prior using Normalizing Flows to adaptively model complex, history-conditioned multimodal patterns. The method revisits WTP from a causal perspective, identifying user-specific patterns as structural confounders. The authors also released TimeRec, the first open-source WTP library. Extensive offline experiments and online A/B tests confirm FlowTime's significant superiority over state-of-the-art methods.
Key takeaway
For AI Scientists and Machine Learning Engineers optimizing user engagement in short-video recommender systems, FlowTime offers a robust solution to overcome the limitations of traditional watch time prediction. You should explore its Continuous Generative Regression paradigm and Flow-based Personalized Priors to better capture multimodal user-item interaction patterns. Consider integrating the open-source TimeRec library to benchmark and implement these advanced prediction capabilities, potentially leading to more accurate engagement metrics and improved system performance.
Key insights
FlowTime leverages continuous generative regression and personalized priors to enhance watch time prediction accuracy and address data multimodality.
Principles
- User-specific patterns act as structural confounders in watch time prediction.
- Multimodality and heterogeneity are critical challenges in user-item interaction modeling.
- Continuous Generative Regression offers a new paradigm for watch time prediction.
Method
FlowTime utilizes a One-step Generative Variational Autoencoder and a Flow-based Personalized Prior, which employs Normalizing Flows to warp a standard Gaussian prior into a history-conditioned manifold.
In practice
- Adopt Flow-based Personalized Priors for adaptive multimodal pattern modeling.
- Consider Continuous Generative Regression for improved WTP expressivity.
- Utilize the TimeRec library for WTP benchmarking and development.
Topics
- Watch Time Prediction
- Recommender Systems
- Generative Models
- Variational Autoencoders
- Normalizing Flows
- User Engagement
- TimeRec Library
Best for: Research Scientist, AI Scientist, Machine Learning Engineer, AI Engineer
Related on AIssential
Editorial summary, takeaway, and curation by AIssential. Original article published by Artificial Intelligence.