MLLGOCJun 9, 2021

From inexact optimization to learning via gradient concentration

arXiv:2106.05397v36 citations
AI Analysis

This work addresses the fundamental problem of generalization in machine learning for researchers and practitioners, providing theoretical insights into how optimization affects learning outcomes.

The paper tackles the gap between minimizing empirical training error and achieving low test error by combining gradient concentration with inexact optimization theory, deriving sharp test error guarantees and highlighting implicit regularization in unconstrained objectives.

Optimization in machine learning typically deals with the minimization of empirical objectives defined by training data. However, the ultimate goal of learning is to minimize the error on future data (test error), for which the training data provides only partial information. In this view, the optimization problems that are practically feasible are based on inexact quantities that are stochastic in nature. In this paper, we show how probabilistic results, specifically gradient concentration, can be combined with results from inexact optimization to derive sharp test error guarantees. By considering unconstrained objectives we highlight the implicit regularization properties of optimization for learning.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes