LGMLDec 9, 2024

Gentle Local Robustness implies Generalization

arXiv:2412.06381v24 citationsh-index: 14Mach learn
AI Analysis

This work provides improved theoretical guarantees for generalization in machine learning, particularly for deep neural networks, though it is incremental in refining existing bounds.

The paper addresses the issue that existing robustness-based generalization bounds are vacuous for the Bayes optimal classifier, and introduces novel model-dependent bounds that are provably tighter and converge to the true error as sample size increases.

Robustness and generalization ability of machine learning models are of utmost importance in various application domains. There is a wide interest in efficient ways to analyze those properties. One important direction is to analyze connection between those two properties. Prior theories suggest that a robust learning algorithm can produce trained models with a high generalization ability. However, we show in this work that the existing error bounds are vacuous for the Bayes optimal classifier which is the best among all measurable classifiers for a classification problem with overlapping classes. Those bounds cannot converge to the true error of this ideal classifier. This is undesirable, surprizing, and never known before. We then present a class of novel bounds, which are model-dependent and provably tighter than the existing robustness-based ones. Unlike prior ones, our bounds are guaranteed to converge to the true error of the best classifier, as the number of samples increases. We further provide an extensive experiment and find that two of our bounds are often non-vacuous for a large class of deep neural networks, pretrained from ImageNet.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes