LGSTMLFeb 12, 2022

Towards Data-Algorithm Dependent Generalization: a Case Study on Overparameterized Linear Regression

arXiv:2202.06054v43 citations
Originality Incremental advance
AI Analysis

This addresses a major open problem in machine learning for researchers studying overparameterized models, though it is incremental as it builds on existing work with a new analytical approach.

The paper tackles the problem of characterizing generalization in overparameterized linear regression by analyzing the interplay between data distribution and training algorithms, showing that early stopping can enable generalization under weaker conditions than previous methods.

One of the major open problems in machine learning is to characterize generalization in the overparameterized regime, where most traditional generalization bounds become inconsistent even for overparameterized linear regression. In many scenarios, this failure can be attributed to obscuring the crucial interplay between the training algorithm and the underlying data distribution. This paper demonstrate that the generalization behavior of overparameterized model should be analyzed in a both data-relevant and algorithm-relevant manner. To make a formal characterization, We introduce a notion called data-algorithm compatibility, which considers the generalization behavior of the entire data-dependent training trajectory, instead of traditional last-iterate analysis. We validate our claim by studying the setting of solving overparameterized linear regression with gradient descent. Specifically, we perform a data-dependent trajectory analysis and derive a sufficient condition for compatibility in such a setting. Our theoretical results demonstrate that if we take early stopping iterates into consideration, generalization can hold with significantly weaker restrictions on the problem instance than the previous last-iterate analysis.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes