MLDIS-NNLGJan 31, 2022

Fluctuations, Bias, Variance & Ensemble of Learners: Exact Asymptotics for Convex Losses in High-Dimension

arXiv:2201.13383v128 citations
Originality Incremental advance
AI Analysis

This work addresses the fundamental issue of randomness and generalization in machine learning for researchers, offering a rigorous theoretical framework that is incremental but broad in scope.

The authors tackled the problem of understanding statistical fluctuations in ensembles of generalized linear models trained on correlated features in high dimensions, providing a complete asymptotic description of the joint distribution of empirical risk minimizers for convex losses and regularization, which helps analyze ensembling effects and the double-descent phenomenon.

From the sampling of data to the initialisation of parameters, randomness is ubiquitous in modern Machine Learning practice. Understanding the statistical fluctuations engendered by the different sources of randomness in prediction is therefore key to understanding robust generalisation. In this manuscript we develop a quantitative and rigorous theory for the study of fluctuations in an ensemble of generalised linear models trained on different, but correlated, features in high-dimensions. In particular, we provide a complete description of the asymptotic joint distribution of the empirical risk minimiser for generic convex loss and regularisation in the high-dimensional limit. Our result encompasses a rich set of classification and regression tasks, such as the lazy regime of overparametrised neural networks, or equivalently the random features approximation of kernels. While allowing to study directly the mitigating effect of ensembling (or bagging) on the bias-variance decomposition of the test error, our analysis also helps disentangle the contribution of statistical fluctuations, and the singular role played by the interpolation threshold that are at the roots of the "double-descent" phenomenon.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes