MLITLGFeb 5, 2023

Tighter Information-Theoretic Generalization Bounds from Supersamples

arXiv:2302.02432v326 citationsh-index: 32
AI Analysis

This work provides incremental improvements in generalization bounds for machine learning theory, addressing a specific theoretical bottleneck.

The authors tackled the problem of deriving tighter information-theoretic generalization bounds for learning algorithms in the supersample setting, achieving bounds that are theoretically or empirically tighter than all previous ones in this context.

In this work, we present a variety of novel information-theoretic generalization bounds for learning algorithms, from the supersample setting of Steinke & Zakynthinou (2020)-the setting of the "conditional mutual information" framework. Our development exploits projecting the loss pair (obtained from a training instance and a testing instance) down to a single number and correlating loss values with a Rademacher sequence (and its shifted variants). The presented bounds include square-root bounds, fast-rate bounds, including those based on variance and sharpness, and bounds for interpolating algorithms etc. We show theoretically or empirically that these bounds are tighter than all information-theoretic bounds known to date on the same supersample setting.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes