STMLApr 17, 2020

Analyzing the discrepancy principle for kernelized spectral filter learning algorithms

arXiv:2004.08436v122 citations
AI Analysis

This work addresses a methodological challenge in iterative machine learning for researchers, providing incremental theoretical improvements in adaptive stopping rules.

The paper tackles the problem of determining optimal early stopping rules for kernelized spectral filter learning algorithms in nonparametric regression, showing that the classical discrepancy principle adapts to slow learning rates while smoothed versions adapt to faster rates.

We investigate the construction of early stopping rules in the nonparametric regression problem where iterative learning algorithms are used and the optimal iteration number is unknown. More precisely, we study the discrepancy principle, as well as modifications based on smoothed residuals, for kernelized spectral filter learning algorithms including gradient descent. Our main theoretical bounds are oracle inequalities established for the empirical estimation error (fixed design), and for the prediction error (random design). From these finite-sample bounds it follows that the classical discrepancy principle is statistically adaptive for slow rates occurring in the hard learning scenario, while the smoothed discrepancy principles are adaptive over ranges of faster rates (resp. higher smoothness parameters). Our approach relies on deviation inequalities for the stopping rules in the fixed design setting, combined with change-of-norm arguments to deal with the random design setting.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes