1S-DAug: One-Shot Data Augmentation for Robust Few-Shot Generalization
Summary
1S-DAug is a novel one-shot generative data augmentation operator designed to enhance few-shot learning (FSL) model generalization to novel classes, particularly at test time. Unlike traditional test-time augmentations that often lack effectiveness, 1S-DAug synthesizes diverse yet faithful image variants from a single example. This is achieved by combining geometric perturbations, controlled noise injection, and a denoising diffusion process conditioned on the original image. The generated images are then encoded and aggregated with the original image into a combined representation, leading to more robust FSL predictions. Implemented as a training-free, model-agnostic plugin, 1S-DAug consistently improves FSL performance across four different datasets, including a proportional accuracy improvement exceeding 10% on the miniImagenet 5-way-1-shot benchmark, without requiring any model parameter updates.
Key takeaway
For AI Engineers and Research Scientists working on few-shot learning, 1S-DAug offers a significant advantage by improving model generalization to unseen classes without requiring model retraining or parameter updates. You should consider integrating this training-free, model-agnostic plugin into your FSL pipelines, especially for applications like medical imaging or autonomous driving where data scarcity is common. This approach can yield substantial accuracy gains, as demonstrated by the >10% improvement on miniImagenet, by enhancing the diversity and robustness of test-time representations.
Key insights
1S-DAug improves few-shot learning generalization by generating diverse, faithful image variants at test time via diffusion.
Principles
- Diversity and accuracy are crucial for effective data augmentation.
- Test-time augmentation directly addresses novel class generalization.
- Feature aggregation enhances robustness for non-parametric classifiers.
Method
1S-DAug applies geometric transformations, injects controlled noise, and then denoises via an attention-guided diffusion process conditioned on the original image to synthesize variants for aggregation.
In practice
- Use 1S-DAug as a training-free plugin for FSL models.
- Adjust noise level to balance diversity and faithfulness.
- Aggregate features from original and augmented views for prediction.
Topics
- Few-Shot Learning
- Data Augmentation
- Diffusion Models
- Generative Augmentation
- Test-Time Augmentation
Best for: AI Engineer, Research Scientist, AI Scientist, Machine Learning Engineer, Computer Vision Engineer
Related on AIssential
Editorial summary, takeaway, and curation by AIssential. Original article published by cs.AI updates on arXiv.org.