LGSTMLMay 22, 2020

Instability, Computational Efficiency and Statistical Accuracy

arXiv:2005.11411v221 citations
AI Analysis

This work addresses computational efficiency and statistical accuracy for researchers in statistics and machine learning, offering a general framework that is incremental but provides specific gains in algorithm analysis.

The paper tackles the problem of statistical accuracy in estimators defined as fixed points of data-dependent operators, analyzing how algorithm stability and convergence rate affect performance. It shows that unstable algorithms like Newton's method can achieve the same statistical accuracy as stable ones in exponentially fewer steps, reducing iterations from polynomial to logarithmic in sample size.

Many statistical estimators are defined as the fixed point of a data-dependent operator, with estimators based on minimizing a cost function being an important special case. The limiting performance of such estimators depends on the properties of the population-level operator in the idealized limit of infinitely many samples. We develop a general framework that yields bounds on statistical accuracy based on the interplay between the deterministic convergence rate of the algorithm at the population level, and its degree of (in)stability when applied to an empirical object based on $n$ samples. Using this framework, we analyze both stable forms of gradient descent and some higher-order and unstable algorithms, including Newton's method and its cubic-regularized variant, as well as the EM algorithm. We provide applications of our general results to several concrete classes of models, including Gaussian mixture estimation, non-linear regression models, and informative non-response models. We exhibit cases in which an unstable algorithm can achieve the same statistical accuracy as a stable algorithm in exponentially fewer steps -- namely, with the number of iterations being reduced from polynomial to logarithmic in sample size $n$.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes