FlashbackCL: Mitigating Temporal Forgetting in Federated Learning
Summary
Flashback Continual Learning (FlashbackCL) is a new Federated Learning (FL) method designed to address temporal forgetting in deployments where client data distributions drift over time. It extends Flashback, a recent FL method, which falters under temporal shifts due to miscalibrated per-class label counts. FlashbackCL integrates three key components: temporally-decayed label counts, a device-aware replay buffer utilizing Class-Balanced Reservoir Sampling (CBRS), and server-side active coreset curation on a public distillation set. Evaluations on CIFAR-10 with 50 clients and three temporal shift modes demonstrate FlashbackCL's effectiveness, achieving a 6.9% to 10.0% relative improvement over Flashback and reducing temporal forgetting by up to 68%. Ablation studies pinpoint CBRS replay as the critical component. FlashbackCL also improves Flashback by 3.5 points on stationary CIFAR-100, indicating its utility for spatial heterogeneity.
Key takeaway
For Machine Learning Engineers deploying Federated Learning systems where client data distributions drift over time, you should integrate FlashbackCL's components to mitigate temporal forgetting. Specifically, prioritize implementing Class-Balanced Reservoir Sampling (CBRS) replay buffers, as this is the critical component for reducing forgetting by up to 68%. Additionally, consider incorporating temporally-decayed label counts and server-side active coreset curation to further enhance model robustness against evolving data.
Key insights
FlashbackCL mitigates temporal forgetting in Federated Learning by combining decayed label counts, class-balanced replay, and server-side coreset curation.
Principles
- Temporal distribution shifts miscalibrate static knowledge proxies.
- Class-balanced replay buffers are critical for temporal forgetting.
- Server-side coreset curation enhances FL robustness.
Method
FlashbackCL extends Flashback by using temporally-decayed label counts, a device-aware replay buffer with Class-Balanced Reservoir Sampling (CBRS), and server-side active coreset curation.
In practice
- Implement CBRS replay to counter temporal data drift.
- Decay label counts to adapt to evolving client distributions.
- Use server-side coreset curation for public distillation sets.
Topics
- Federated Learning
- Continual Learning
- Temporal Forgetting
- Class-Balanced Reservoir Sampling
- Data Distribution Shift
- Replay Buffers
Best for: Research Scientist, AI Scientist, Machine Learning Engineer
Related on AIssential
Editorial summary, takeaway, and curation by AIssential. Original article published by Artificial Intelligence.