OCLGMLMay 24, 2018

Online Regularized Nonlinear Acceleration

arXiv:1805.09639v214 citations
Originality Incremental advance
AI Analysis

This work addresses the problem of efficiently accelerating optimization algorithms for researchers and practitioners in machine learning and numerical analysis, representing an incremental improvement by extending RNA to faster algorithms.

The paper tackles the limitation of regularized nonlinear acceleration (RNA) in accelerating fast multistep algorithms like Nesterov's method, which often causes divergence, by adapting RNA to work online with such methods. The result shows optimal complexity bounds for quadratics and asymptotically optimal rates on general convex problems, with significant numerical performance improvements over classical accelerated methods.

Regularized nonlinear acceleration (RNA) estimates the minimum of a function by post-processing iterates from an algorithm such as the gradient method. It can be seen as a regularized version of Anderson acceleration, a classical acceleration scheme from numerical analysis. The new scheme provably improves the rate of convergence of fixed step gradient descent, and its empirical performance is comparable to that of quasi-Newton methods. However, RNA cannot accelerate faster multistep algorithms like Nesterov's method and often diverges in this context. Here, we adapt RNA to overcome these issues, so that our scheme can be used on fast algorithms such as gradient methods with momentum. We show optimal complexity bounds for quadratics and asymptotically optimal rates on general convex minimization problems. Moreover, this new scheme works online, i.e., extrapolated solution estimates can be reinjected at each iteration, significantly improving numerical performance over classical accelerated methods.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes