ITLGMLFeb 3, 2021

Information-Theoretic Bounds on the Moments of the Generalization Error of Learning Algorithms

arXiv:2102.02016v219 citations
AI Analysis

This work addresses the fundamental problem of understanding and bounding the generalization error for machine learning researchers, offering a more detailed statistical characterization beyond just the expected value.

This paper provides a more refined analysis of machine learning model generalization behavior by characterizing bounds on the moments of their generalization error. It builds upon a new bound for the expected value of an arbitrary function of population and empirical risk.

Generalization error bounds are critical to understanding the performance of machine learning models. In this work, building upon a new bound of the expected value of an arbitrary function of the population and empirical risk of a learning algorithm, we offer a more refined analysis of the generalization behaviour of a machine learning models based on a characterization of (bounds) to their generalization error moments. We discuss how the proposed bounds -- which also encompass new bounds to the expected generalization error -- relate to existing bounds in the literature. We also discuss how the proposed generalization error moment bounds can be used to construct new generalization error high-probability bounds.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes