A Quantitative Characterization of Forgetting in Post-Training
Summary
Krishnakumar Balasubramanian and Shiva Prasad Kasiviswanathan's March 2026 paper, "A Quantitative Characterization of Forgetting in Post-Training," investigates the mechanisms behind forgetting in continually post-trained generative models. Building on a two-mode mixture abstraction from Chen et al. (2025), the authors formalize forgetting into two types: mass forgetting, where the old task's mixture weight collapses, and old-component drift, where an existing correct component shifts. They prove that forward-KL objectives lead to mass forgetting, while reverse-KL objectives avoid it and cause old-component drift that decays exponentially with mode separation. The study also quantifies how replay interacts with these objectives, showing it modifies the training distribution for forward-KL but prevents old-mode starvation for reverse-KL. Finally, the paper analyzes SDFT, TTT-Discover, and OAPL, deriving conditions for retaining old mass and controlling drift.
Key takeaway
For AI Researchers developing continual learning strategies, understanding the interplay between divergence objectives and replay is critical. Your choice of forward-KL versus reverse-KL directly impacts whether old knowledge is completely forgotten or merely drifts. Prioritize reverse-KL and strategic replay to preserve past learning effectively, especially when fine-tuning generative models on new data.
Key insights
Forgetting in generative models is quantifiable, driven by divergence direction, geometric overlap, and sampling.
Principles
- Forward-KL objectives drive old task weights to zero.
- Reverse-KL objectives avoid mass forgetting.
- Drift decays exponentially with mode separation.
Method
The paper formalizes forgetting using a two-mode mixture abstraction, distinguishing between mass forgetting and old-component drift, and analyzes divergence objectives and replay mechanisms.
In practice
- Use reverse-KL to mitigate mass forgetting.
- Employ replay to prevent old-mode starvation.
- Consider mode separation for drift control.
Topics
- Continual Learning
- Generative Models
- Catastrophic Forgetting
- KL Divergence
- Replay Mechanisms
Best for: AI Researcher, AI Scientist, Research Scientist
Related on AIssential
Editorial summary, takeaway, and curation by AIssential. Original article published by Takara TLDR - Daily AI Papers.