LG CR OCOct 24, 2024

Faster Algorithms for User-Level Private Stochastic Convex Optimization

arXiv:2410.18391v17.93 citationsh-index: 15NIPS

Originality Highly original

AI Analysis

This work addresses the challenge of efficiently protecting user privacy in large-scale machine learning applications, such as on mobile devices, by improving computational efficiency without stringent assumptions, representing a strong specific gain rather than a foundational breakthrough.

The paper tackles the problem of user-level differential privacy in stochastic convex optimization, where existing algorithms are impractical due to restrictive assumptions or high computational costs, and it provides novel algorithms with state-of-the-art excess risk and runtime guarantees, such as achieving optimal excess risk in approximately (mn)^{9/8} gradient computations for smooth losses.

We study private stochastic convex optimization (SCO) under user-level differential privacy (DP) constraints. In this setting, there are $n$ users (e.g., cell phones), each possessing $m$ data items (e.g., text messages), and we need to protect the privacy of each user's entire collection of data items. Existing algorithms for user-level DP SCO are impractical in many large-scale machine learning scenarios because: (i) they make restrictive assumptions on the smoothness parameter of the loss function and require the number of users to grow polynomially with the dimension of the parameter space; or (ii) they are prohibitively slow, requiring at least $(mn)^{3/2}$ gradient computations for smooth losses and $(mn)^3$ computations for non-smooth losses. To address these limitations, we provide novel user-level DP algorithms with state-of-the-art excess risk and runtime guarantees, without stringent assumptions. First, we develop a linear-time algorithm with state-of-the-art excess risk (for a non-trivial linear-time algorithm) under a mild smoothness assumption. Our second algorithm applies to arbitrary smooth losses and achieves optimal excess risk in $\approx (mn)^{9/8}$ gradient computations. Third, for non-smooth loss functions, we obtain optimal excess risk in $n^{11/8} m^{5/4}$ gradient computations. Moreover, our algorithms do not require the number of users to grow polynomially with the dimension.

View on arXiv PDF

Similar