LGMLJan 9, 2022

Stability Based Generalization Bounds for Exponential Family Langevin Dynamics

arXiv:2201.03064v29 citations
Originality Incremental advance
AI Analysis

This work provides improved generalization bounds for machine learning practitioners using noisy optimization algorithms, though it is incremental as it builds on prior stability-based approaches.

The paper tackles the problem of deriving generalization bounds for noisy stochastic algorithms by introducing Exponential Family Langevin Dynamics (EFLD), a generalization of stochastic gradient Langevin dynamics, and establishes data-dependent bounds with O(1/n) sample dependence and gradient discrepancy dependence, resulting in non-vacuous and sharper bounds than existing ones, as shown in empirical benchmarks.

Recent years have seen advances in generalization bounds for noisy stochastic algorithms, especially stochastic gradient Langevin dynamics (SGLD) based on stability (Mou et al., 2018; Li et al., 2020) and information theoretic approaches (Xu and Raginsky, 2017; Negrea et al., 2019; Steinke and Zakynthinou, 2020). In this paper, we unify and substantially generalize stability based generalization bounds and make three technical contributions. First, we bound the generalization error in terms of expected (not uniform) stability which arguably leads to quantitatively sharper bounds. Second, as our main contribution, we introduce Exponential Family Langevin Dynamics (EFLD), a substantial generalization of SGLD, which includes noisy versions of Sign-SGD and quantized SGD as special cases. We establish data-dependent expected stability based generalization bounds for any EFLD algorithm with a O(1/n) sample dependence and dependence on gradient discrepancy rather than the norm of gradients, yielding significantly sharper bounds. Third, we establish optimization guarantees for special cases of EFLD. Further, empirical results on benchmarks illustrate that our bounds are non-vacuous, quantitatively sharper than existing bounds, and behave correctly under noisy labels.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes