LGCROCOct 24, 2024

Faster Algorithms for User-Level Private Stochastic Convex Optimization

arXiv:2410.18391v13 citationsh-index: 15NIPS
Originality Highly original
AI Analysis

This work addresses the challenge of efficiently protecting user privacy in large-scale machine learning applications, such as on mobile devices, by improving computational efficiency without stringent assumptions, representing a strong specific gain rather than a foundational breakthrough.

The paper tackles the problem of user-level differential privacy in stochastic convex optimization, where existing algorithms are impractical due to restrictive assumptions or high computational costs, and it provides novel algorithms with state-of-the-art excess risk and runtime guarantees, such as achieving optimal excess risk in approximately (mn)^{9/8} gradient computations for smooth losses.

We study private stochastic convex optimization (SCO) under user-level differential privacy (DP) constraints. In this setting, there are $n$ users (e.g., cell phones), each possessing $m$ data items (e.g., text messages), and we need to protect the privacy of each user's entire collection of data items. Existing algorithms for user-level DP SCO are impractical in many large-scale machine learning scenarios because: (i) they make restrictive assumptions on the smoothness parameter of the loss function and require the number of users to grow polynomially with the dimension of the parameter space; or (ii) they are prohibitively slow, requiring at least $(mn)^{3/2}$ gradient computations for smooth losses and $(mn)^3$ computations for non-smooth losses. To address these limitations, we provide novel user-level DP algorithms with state-of-the-art excess risk and runtime guarantees, without stringent assumptions. First, we develop a linear-time algorithm with state-of-the-art excess risk (for a non-trivial linear-time algorithm) under a mild smoothness assumption. Our second algorithm applies to arbitrary smooth losses and achieves optimal excess risk in $\approx (mn)^{9/8}$ gradient computations. Third, for non-smooth loss functions, we obtain optimal excess risk in $n^{11/8} m^{5/4}$ gradient computations. Moreover, our algorithms do not require the number of users to grow polynomially with the dimension.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes