LG OC MLDec 15, 2024

Optimal Rates for Robust Stochastic Convex Optimization

Changyu Gao, Andrew Lowy, Xingyu Zhou, Stephen J. Wright

arXiv:2412.11003v32.6h-index: 10FORC

Originality Highly original

AI Analysis

This work addresses a fundamental open problem in robust machine learning for high-dimensional settings, providing optimal algorithms that improve over existing suboptimal methods and relax assumptions, with potential applications in domains vulnerable to adversarial data corruption.

The paper tackles the problem of determining optimal rates for robust stochastic convex optimization under the ε-contamination model, where an adversary can corrupt a fraction of samples, and develops algorithms that achieve minimax-optimal excess risk up to logarithmic factors without requiring stringent assumptions like Lipschitz continuity or smoothness of individual functions.

Machine learning algorithms in high-dimensional settings are highly susceptible to the influence of even a small fraction of structured outliers, making robust optimization techniques essential. In particular, within the $ε$-contamination model, where an adversary can inspect and replace up to an $ε$-fraction of the samples, a fundamental open problem is determining the optimal rates for robust stochastic convex optimization (SCO) under such contamination. We develop novel algorithms that achieve minimax-optimal excess risk (up to logarithmic factors) under the $ε$-contamination model. Our approach improves over existing algorithms, which are not only suboptimal but also require stringent assumptions, including Lipschitz continuity and smoothness of individual sample functions. By contrast, our optimal algorithms do not require these stringent assumptions, assuming only population-level smoothness of the loss. Moreover, our algorithms can be adapted to handle the case in which the covariance parameter is unknown, and can be extended to nonsmooth population risks via convolutional smoothing. We complement our algorithmic developments with a tight information-theoretic lower bound for robust SCO.

View on arXiv PDF

Similar