MLCRLGOct 18, 2024

Privacy for Free in the Overparameterized Regime

arXiv:2410.14787v22 citationsh-index: 25
Originality Highly original
AI Analysis

This addresses a key theoretical gap for practitioners in deep learning, challenging the belief that overparameterization inherently harms private learning performance.

The paper tackles the problem of performance degradation in differentially private gradient descent (DP-GD) for overparameterized models, showing that in the random features model with quadratic loss, privacy can be achieved for free with negligible excess risk, even in strongly private settings.

Differentially private gradient descent (DP-GD) is a popular algorithm to train deep learning models with provable guarantees on the privacy of the training data. In the last decade, the problem of understanding its performance cost with respect to standard GD has received remarkable attention from the research community, which formally derived upper bounds on the excess population risk $R_{P}$ in different learning settings. However, existing bounds typically degrade with over-parameterization, i.e., as the number of parameters $p$ gets larger than the number of training samples $n$ -- a regime which is ubiquitous in current deep-learning practice. As a result, the lack of theoretical insights leaves practitioners without clear guidance, leading some to reduce the effective number of trainable parameters to improve performance, while others use larger models to achieve better results through scale. In this work, we show that in the popular random features model with quadratic loss, for any sufficiently large $p$, privacy can be obtained for free, i.e., $\left|R_{P} \right| = o(1)$, not only when the privacy parameter $\varepsilon$ has constant order, but also in the strongly private setting $\varepsilon = o(1)$. This challenges the common wisdom that over-parameterization inherently hinders performance in private learning.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes