Optimal Rates for Pure {\varepsilon}-Differentially Private Stochastic Convex Optimization with Heavy Tails

· Source: stat.ML updates on arXiv.org · Field: Technology & Digital — Artificial Intelligence & Machine Learning, Cybersecurity & Data Privacy · Depth: Expert, long

Summary

A new study characterizes the minimax optimal excess-risk rate for pure ε-differentially private (DP) stochastic convex optimization (SCO) with heavy-tailed gradients, achieving this rate up to logarithmic factors. The research addresses a long-standing open problem in pure ε-DP, where prior work focused on approximate (ε,δ)-DP. The proposed algorithm achieves the optimal rate in polynomial time with high probability, and with probability 1 if the worst-case Lipschitz parameter is polynomially bounded. For specific structured problem classes, including hinge/ReLU-type and absolute-value losses on Euclidean balls, ellipsoids, and polytopes, the algorithm guarantees the same excess-risk in deterministic polynomial time, even with infinite worst-case Lipschitz parameters. This approach introduces a novel framework for privately optimizing Lipschitz extensions of the empirical loss and provides a new high-probability lower bound.

Key takeaway

For research scientists developing differentially private machine learning algorithms, this work provides a critical advancement for pure ε-DP SCO with heavy-tailed data. You should consider adopting the Lipschitz extension and double output perturbation framework to achieve minimax optimal excess risk rates, especially when dealing with unbounded gradient distributions where traditional Lipschitz assumptions are too restrictive or vacuous. This approach offers a computationally efficient path to strong privacy guarantees.

Key insights

Optimal pure ε-DP heavy-tailed SCO rates are achieved via Lipschitz extensions and double output perturbation.

Principles

Method

The method involves reducing heavy-tailed SCO to regularized ERM, optimizing Lipschitz extensions via a jointly convex reformulation and adaptive projected subgradient methods, and applying a double output perturbation for privacy.

In practice

Topics

Best for: Research Scientist, AI Scientist

Related on AIssential

Open in AIssential →

Editorial summary, takeaway, and curation by AIssential. Original article published by stat.ML updates on arXiv.org.