LGMLOct 20, 2022

Tighter PAC-Bayes Generalisation Bounds by Leveraging Example Difficulty

arXiv:2210.11289v110 citationsh-index: 21
Originality Incremental advance
AI Analysis

This work provides incremental improvements in generalization theory for machine learning practitioners, potentially leading to more reliable model evaluation.

The paper tackles the problem of obtaining tighter PAC-Bayesian generalization bounds by introducing a modified excess risk that uses example difficulty to reduce variance, resulting in improved bounds as demonstrated empirically on real-world datasets.

We introduce a modified version of the excess risk, which can be used to obtain tighter, fast-rate PAC-Bayesian generalisation bounds. This modified excess risk leverages information about the relative hardness of data examples to reduce the variance of its empirical counterpart, tightening the bound. We combine this with a new bound for $[-1, 1]$-valued (and potentially non-independent) signed losses, which is more favourable when they empirically have low variance around $0$. The primary new technical tool is a novel result for sequences of interdependent random vectors which may be of independent interest. We empirically evaluate these new bounds on a number of real-world datasets.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes