Escaping Iterative Parameter-Space Noise: Differentially Private Learning with a Hypernetwork
Summary
DP-DeepSets, a novel differentially private (DP) learning framework, addresses the utility loss in gradient-based DP methods by avoiding iterative, high-dimensional noise injection. It employs a hypernetwork, trained on public datasets, to map a private dataset directly to target model parameters. The approach embeds each private example into a low-dimensional representation, aggregates these, and injects DP noise only once into this low-dimensional dataset embedding. This mechanism significantly reduces the adverse effects of noise. Theoretically, DP-DeepSets achieves higher utility than DP-SGD in a synthetic linear regression setting. Experimentally, it demonstrates lower FID scores for LoRA fine-tuning of diffusion models on CIFAR-10 with 128 private data points, outperforming DP-SGD and other public-data-guided approaches, especially at strict privacy budgets like ε=1.
Key takeaway
For Machine Learning Engineers or AI Scientists fine-tuning large models on sensitive, small private datasets, traditional DP-SGD incurs substantial utility loss. You should evaluate DP-DeepSets as an alternative. Its single, low-dimensional noise injection method can yield significantly higher model utility, particularly for LoRA fine-tuning of diffusion models, even under strict privacy budgets like ε=1. This approach offers a path to better performance where gradient-based DP methods struggle.
Key insights
Differentially private learning can achieve higher utility by injecting noise once into a low-dimensional dataset embedding via a public hypernetwork.
Principles
- Single, low-dimensional noise injection improves DP utility.
- Public hypernetworks can learn dataset-to-parameter mappings.
- DP noise scale is critical, especially for small datasets.
Method
DP-DeepSets embeds private data points, clips their L2-norm, averages them, adds Gaussian noise to this low-dimensional dataset embedding, then uses a Transformer-based network to generate target model parameters.
In practice
- Consider hypernetworks for DP fine-tuning on small private datasets.
- Train hypernetworks on large public datasets to learn "how to learn."
- Prioritize low-dimensional noise injection for better utility.
Topics
- Differential Privacy
- Hypernetworks
- LoRA Fine-tuning
- Diffusion Models
- Privacy-Preserving ML
- Gradient-Based Methods
Code references
Best for: Research Scientist, AI Scientist, Machine Learning Engineer, AI Security Engineer
Related on AIssential
Editorial summary, takeaway, and curation by AIssential. Original article published by stat.ML updates on arXiv.org.