Differentially Private Model Merging

· Source: stat.ML updates on arXiv.org · Field: Technology & Digital — Artificial Intelligence & Machine Learning, Cybersecurity & Data Privacy · Depth: Expert, extended

Summary

This work introduces two post-processing techniques, random selection (RS) and linear combination (LC), for differentially private model merging. The goal is to generate models that satisfy varying target differential privacy (DP) requirements without additional training, given a set of existing models trained on the same dataset with different privacy/utility tradeoffs. The authors provide privacy accounting for both approaches using Rényi DP and privacy loss distributions. A case study on private mean estimation theoretically establishes the superiority of linear combination over random selection. Empirical validation on synthetic and real-world datasets, including MNIST and CIFAR-10, demonstrates that both RS and LC improve privacy/utility tradeoffs compared to naive privacy accounting. The linear combination method requires structural assumptions on the training algorithms, such as DP-SGD, to achieve meaningful privacy gains beyond joint release.

Key takeaway

For research scientists developing machine learning applications with evolving privacy needs, consider implementing differentially private model merging techniques like Random Selection or Linear Combination. These methods allow you to adapt existing private models to new privacy constraints efficiently, avoiding costly retraining. Prioritize Linear Combination if your base models were trained with DP-SGD, as it offers tighter privacy bounds and better utility. This approach can significantly streamline model deployment and compliance with dynamic regulations.

Key insights

Merge existing differentially private models to meet dynamic privacy requirements without retraining.

Principles

Method

Two data-independent post-processing techniques: random selection (RS) and linear combination (LC). RS samples a model based on a probability distribution; LC linearly averages models. Both optimize mixing coefficients for utility under privacy constraints.

In practice

Topics

Best for: Research Scientist, AI Scientist, Machine Learning Engineer, AI Security Engineer

Related on AIssential

Open in AIssential →

Editorial summary, takeaway, and curation by AIssential. Original article published by stat.ML updates on arXiv.org.