LGFeb 24, 2025

On Traceability in $\ell_p$ Stochastic Convex Optimization

arXiv:2502.17384v21 citationsh-index: 27
Originality Highly original
AI Analysis

This work addresses the problem of balancing privacy and accuracy in machine learning for researchers and practitioners, offering theoretical insights into traceability and differential privacy, though it is incremental in building on existing privacy frameworks.

The paper investigates the necessity of traceability for accurate learning in stochastic convex optimization under ℓ_p geometries, establishing a fundamental tradeoff where below a certain excess risk threshold, every sample-efficient learner is traceable, with thresholds coinciding with differential privacy bounds for p in [1,2] and providing novel lower bounds for p > 2.

In this paper, we investigate the necessity of traceability for accurate learning in stochastic convex optimization (SCO) under $\ell_p$ geometries. Informally, we say a learning algorithm is $m$-traceable if, by analyzing its output, it is possible to identify at least $m$ of its training samples. Our main results uncover a fundamental tradeoff between traceability and excess risk in SCO. For every $p\in [1,\infty)$, we establish the existence of an excess risk threshold below which every sample-efficient learner is traceable with the number of samples which is a constant fraction of its training sample. For $p\in [1,2]$, this threshold coincides with the best excess risk of differentially private (DP) algorithms, i.e., above this threshold, there exist algorithms that are not traceable, which corresponds to a sharp phase transition. For $p \in (2,\infty)$, this threshold instead gives novel lower bounds for DP learning, partially closing an open problem in this setup. En route to establishing these results, we prove a sparse variant of the fingerprinting lemma, which is of independent interest to the community.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes