Distill Once, Adapt Life-Long: Exploring Dataset Distillation for Continual Test-Time Adaptation
Summary
The DO-ALL (Distill Once, Adapt Life-Long) framework addresses challenges in Continual Test-Time Adaptation (CTTA), particularly when source datasets cannot be retained due to privacy or licensing. Existing source-free CTTA methods often suffer from instability, compounding self-training errors, and catastrophic forgetting under long-term distribution shifts. DO-ALL mitigates these issues by employing Dataset Distillation (DD) pre-deployment to generate a small set of synthetic "distilled anchors" that compactly summarize the source distribution. During adaptation, DO-ALL matches each target sample with its most semantically aligned anchor, providing a stable reference for CTTA through source replay, representation alignment, and manifold-smoothing regularization. This plug-and-play framework seamlessly integrates into existing CTTA algorithms, consistently enhancing long-term robustness across CIFAR100-C, ImageNet-C, and the CCC benchmark. The code is available at https://github.com/blue-531/DOALL.
Key takeaway
For Machine Learning Engineers deploying models in evolving environments with data privacy concerns, DO-ALL offers a robust solution to maintain performance. You should consider integrating Dataset Distillation to create compact source references, mitigating catastrophic forgetting and self-training errors in Continual Test-Time Adaptation. This approach allows your models to adapt continuously without retaining raw source data, enhancing long-term stability and compliance.
Key insights
DO-ALL uses Dataset Distillation to create stable, privacy-conscious source references for robust continual test-time adaptation.
Principles
- Compact source summaries stabilize adaptation.
- Semantic alignment improves reference utility.
- Replay and regularization counter forgetting.
Method
Before deployment, perform Dataset Distillation to create synthetic anchors. During adaptation, match target samples to anchors for stable reference via replay, alignment, and regularization.
In practice
- Integrate distilled anchors into existing CTTA.
- Use for privacy-sensitive model deployments.
- Apply to evolving image classification tasks.
Topics
- Continual Test-Time Adaptation
- Dataset Distillation
- Distribution Shift
- Catastrophic Forgetting
- Model Adaptation
- Computer Vision
Code references
Best for: Computer Vision Engineer, Research Scientist, AI Scientist, Machine Learning Engineer
Related on AIssential
Editorial summary, takeaway, and curation by AIssential. Original article published by Computer Vision and Pattern Recognition.