LGOCMLJun 24, 2020

Advances in Asynchronous Parallel and Distributed Optimization

arXiv:2006.13838v199 citations
Originality Synthesis-oriented
AI Analysis

This is an incremental review article summarizing existing developments in optimization methods for scalable machine learning applications.

The paper reviews recent advances in asynchronous parallel and distributed optimization methods for large-scale machine learning problems, highlighting their efficiency in resource use and robustness to issues like slow nodes and unreliable communication.

Motivated by large-scale optimization problems arising in the context of machine learning, there have been several advances in the study of asynchronous parallel and distributed optimization methods during the past decade. Asynchronous methods do not require all processors to maintain a consistent view of the optimization variables. Consequently, they generally can make more efficient use of computational resources than synchronous methods, and they are not sensitive to issues like stragglers (i.e., slow nodes) and unreliable communication links. Mathematical modeling of asynchronous methods involves proper accounting of information delays, which makes their analysis challenging. This article reviews recent developments in the design and analysis of asynchronous optimization methods, covering both centralized methods, where all processors update a master copy of the optimization variables, and decentralized methods, where each processor maintains a local copy of the variables. The analysis provides insights as to how the degree of asynchrony impacts convergence rates, especially in stochastic optimization methods.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes