Thinking Fast & Slow for a Personalized Notification System
Summary
Netflix has implemented a hierarchical personalized notification system, inspired by Daniel Kahneman's "Thinking, Fast and Slow," to manage its hundreds of millions of daily messages. This framework addresses the inherent conflict between optimizing for immediate engagement and mitigating long-term member fatigue or opt-out risk. The system comprises a "slow" policy that makes strategic, personalized decisions about a member's weekly messaging plan, including frequency per channel, offering approximately O(100) distinct cross-channel pacing strategies. Concurrently, a "fast" policy handles tactical, real-time message selection when send opportunities arise. This architecture, which uses a feature store to bridge communication, explicitly incorporates a utility function balancing positive engagement signals with a universal message cost to prevent over-messaging. The new system resulted in significant production metric lifts, particularly benefiting casual viewers, by enabling independent evolution of pacing strategies and content ranking.
Key takeaway
For AI Engineers or Product Managers designing personalized communication systems, adopting a hierarchical "slow" and "fast" policy architecture is crucial. This approach allows you to decouple long-term strategic pacing from real-time message selection, explicitly managing user fatigue and optimizing for sustained engagement. You should implement a feature store to enable asynchronous communication between policies, facilitating independent iteration and A/B testing of pacing strategies and content ranking models for significant metric lifts.
Key insights
A hierarchical "slow" and "fast" policy system effectively balances short-term engagement with long-term user experience in personalized notification delivery.
Principles
- Decouple strategic pacing from tactical message selection.
- Explicitly model long-term costs like user fatigue.
- Use a universal message cost to prevent over-messaging.
Method
The system uses a "slow" policy to define personalized weekly message pacing (O(100) combinations) by maximizing a utility function balancing engagement and message cost. A "fast" policy then executes real-time message selection within these strategic guardrails, communicating via a feature store.
In practice
- Implement a feature store for policy-to-policy communication.
- Tune a universal message cost parameter empirically.
- A/B test pacing strategies independently of ranking logic.
Topics
- Personalized Notifications
- Hierarchical AI Systems
- Dual-Process Theory
- Feature Stores
- User Engagement
- Notification Fatigue
Best for: AI Architect, AI Product Manager, Product Manager, Machine Learning Engineer, AI Engineer, MLOps Engineer
Related on AIssential
Editorial summary, takeaway, and curation by AIssential. Original article published by Netflix TechBlog - Medium.