On the Benefits of Accelerated Optimization in Robust and Private Estimation
This work addresses the need for efficient optimization in privacy-preserving and robust machine learning, offering incremental improvements in convergence rates for specific settings like linear regression.
The paper tackles the problem of improving statistical guarantees for robust and private estimation by applying accelerated gradient methods, such as Frank-Wolfe and projected gradient descent with Nesterov's momentum, to reduce iteration complexity and achieve optimal convergence rates in some scenarios.
We study the advantages of accelerated gradient methods, specifically based on the Frank-Wolfe method and projected gradient descent, for privacy and heavy-tailed robustness. Our approaches are as follows: For the Frank-Wolfe method, our technique is based on a tailored learning rate and a uniform lower bound on the gradient of the $\ell_2$-norm over the constraint set. For accelerating projected gradient descent, we use the popular variant based on Nesterov's momentum, and we optimize our objective over $\mathbb{R}^p$. These accelerations reduce iteration complexity, translating into stronger statistical guarantees for empirical and population risk minimization. Our analysis covers three settings: non-random data, random model-free data, and parametric models (linear regression and generalized linear models). Methodologically, we approach both privacy and robustness based on noisy gradients. We ensure differential privacy via the Gaussian mechanism and advanced composition, and we achieve heavy-tailed robustness using a geometric median-of-means estimator, which also sharpens the dependency on the dimension of the covariates. Finally, we compare our rates to existing bounds and identify scenarios where our methods attain optimal convergence.