LGFeb 14, 2024

Information Complexity of Stochastic Convex Optimization: Applications to Generalization and Memorization

arXiv:2402.09327v213 citationsh-index: 27
AI Analysis

This work addresses a theoretical open question in machine learning about the role of memorization in learning, with implications for generalization bounds, but it is incremental as it builds on existing CMI frameworks.

The paper tackles the problem of understanding memorization in stochastic convex optimization by characterizing the tradeoff between algorithm accuracy and conditional mutual information, showing lower bounds of Ω(1/ε²) and Ω(1/ε) for excess error ε under different settings, and demonstrating an adversary that can identify training samples.

In this work, we investigate the interplay between memorization and learning in the context of \emph{stochastic convex optimization} (SCO). We define memorization via the information a learning algorithm reveals about its training data points. We then quantify this information using the framework of conditional mutual information (CMI) proposed by Steinke and Zakynthinou (2020). Our main result is a precise characterization of the tradeoff between the accuracy of a learning algorithm and its CMI, answering an open question posed by Livni (2023). We show that, in the $L^2$ Lipschitz--bounded setting and under strong convexity, every learner with an excess error $\varepsilon$ has CMI bounded below by $Ω(1/\varepsilon^2)$ and $Ω(1/\varepsilon)$, respectively. We further demonstrate the essential role of memorization in learning problems in SCO by designing an adversary capable of accurately identifying a significant fraction of the training samples in specific SCO problems. Finally, we enumerate several implications of our results, such as a limitation of generalization bounds based on CMI and the incompressibility of samples in SCO problems.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes