LGITMLMay 12, 2019

A New Look at an Old Problem: A Universal Learning Approach to Linear Regression

arXiv:1905.04708v136 citations
Originality Synthesis-oriented
AI Analysis

This work addresses generalization challenges in over-parameterized linear regression for statistical learning, presenting an incremental analysis using universal learning principles.

The paper tackles the problem of linear regression in over-parameterized settings where the number of parameters exceeds training samples, showing that generalization is possible if test vectors align with dominant eigenvectors of the training data's correlation matrix, as demonstrated via polynomial fitting simulations.

Linear regression is a classical paradigm in statistics. A new look at it is provided via the lens of universal learning. In applying universal learning to linear regression the hypotheses class represents the label $y\in {\cal R}$ as a linear combination of the feature vector $x^Tθ$ where $x\in {\cal R}^M$, within a Gaussian error. The Predictive Normalized Maximum Likelihood (pNML) solution for universal learning of individual data can be expressed analytically in this case, as well as its associated learnability measure. Interestingly, the situation where the number of parameters $M$ may even be larger than the number of training samples $N$ can be examined. As expected, in this case learnability cannot be attained in every situation; nevertheless, if the test vector resides mostly in a subspace spanned by the eigenvectors associated with the large eigenvalues of the empirical correlation matrix of the training data, linear regression can generalize despite the fact that it uses an ``over-parametrized'' model. We demonstrate the results with a simulation of fitting a polynomial to data with a possibly large polynomial degree.

Code Implementations3 repos
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes