ML LGMar 7, 2022

Fast Rates for Noisy Interpolation Require Rethinking the Effects of Inductive Bias

Konstantin Donhauser, Nicolo Ruggeri, Stefan Stojanovic, Fanny Yang

arXiv:2203.03597v219.425 citationsh-index: 17Has Code

Originality Highly original

AI Analysis

This work addresses a fundamental issue in high-dimensional machine learning by revealing a critical trade-off for practitioners using interpolating models with noise, though it is incremental as it builds on existing theory.

The paper investigates the trade-off between inductive bias and noise sensitivity in interpolating models, showing that stronger inductive biases can worsen noise effects. For sparse linear models, it proves that minimum ℓp-norm interpolators achieve fast polynomial rates close to 1/n for p > 1, compared to a logarithmic rate for p = 1.

Good generalization performance on high-dimensional data crucially hinges on a simple structure of the ground truth and a corresponding strong inductive bias of the estimator. Even though this intuition is valid for regularized models, in this paper we caution against a strong inductive bias for interpolation in the presence of noise: While a stronger inductive bias encourages a simpler structure that is more aligned with the ground truth, it also increases the detrimental effect of noise. Specifically, for both linear regression and classification with a sparse ground truth, we prove that minimum $\ell_p$-norm and maximum $\ell_p$-margin interpolators achieve fast polynomial rates close to order $1/n$ for $p > 1$ compared to a logarithmic rate for $p = 1$. Finally, we provide preliminary experimental evidence that this trade-off may also play a crucial role in understanding non-linear interpolating models used in practice.

View on arXiv PDF Code

Similar