Optimal Rates for Pure {\varepsilon}-Differentially Private Stochastic Convex Optimization with Heavy Tails
Summary
A new study characterizes the minimax optimal excess-risk rate for pure ε-differentially private (DP) stochastic convex optimization (SCO) with heavy-tailed gradients, achieving this rate up to logarithmic factors. The research addresses a long-standing open problem in pure ε-DP, where prior work focused on approximate (ε,δ)-DP. The proposed algorithm achieves the optimal rate in polynomial time with high probability, and with probability 1 if the worst-case Lipschitz parameter is polynomially bounded. For specific structured problem classes, including hinge/ReLU-type and absolute-value losses on Euclidean balls, ellipsoids, and polytopes, the algorithm guarantees the same excess-risk in deterministic polynomial time, even with infinite worst-case Lipschitz parameters. This approach introduces a novel framework for privately optimizing Lipschitz extensions of the empirical loss and provides a new high-probability lower bound.
Key takeaway
For research scientists developing differentially private machine learning algorithms, this work provides a critical advancement for pure ε-DP SCO with heavy-tailed data. You should consider adopting the Lipschitz extension and double output perturbation framework to achieve minimax optimal excess risk rates, especially when dealing with unbounded gradient distributions where traditional Lipschitz assumptions are too restrictive or vacuous. This approach offers a computationally efficient path to strong privacy guarantees.
Key insights
Optimal pure ε-DP heavy-tailed SCO rates are achieved via Lipschitz extensions and double output perturbation.
Principles
- Bounded k-th moment assumption allows for heavy-tailed gradients.
- Lipschitz extensions reduce heavy-tailed ERM to Lipschitz ERM.
- Double output perturbation ensures privacy and accuracy.
Method
The method involves reducing heavy-tailed SCO to regularized ERM, optimizing Lipschitz extensions via a jointly convex reformulation and adaptive projected subgradient methods, and applying a double output perturbation for privacy.
In practice
- Use Lipschitz extensions for heavy-tailed gradient problems.
- Employ adaptive projected subgradient for unknown Lipschitz constants.
- Apply output perturbation for pure ε-DP guarantees.
Topics
- Pure ε-Differential Privacy
- Stochastic Convex Optimization
- Heavy-tailed Gradients
- Minimax Optimal Rates
- Lipschitz Extension
Best for: Research Scientist, AI Scientist
Related on AIssential
Editorial summary, takeaway, and curation by AIssential. Original article published by stat.ML updates on arXiv.org.