OCLGMLOct 7, 2021

A Stochastic Newton Algorithm for Distributed Convex Optimization

arXiv:2110.02954v118 citations
Originality Incremental advance
AI Analysis

This addresses communication bottlenecks in distributed machine learning, offering an incremental improvement over existing methods.

The paper tackles the problem of reducing communication rounds in distributed convex optimization by proposing a stochastic Newton algorithm that uses stochastic Hessian-vector products, showing it can decrease communication frequency without performance loss, with convergence guarantees for quasi-self-concordant objectives like logistic regression.

We propose and analyze a stochastic Newton algorithm for homogeneous distributed stochastic convex optimization, where each machine can calculate stochastic gradients of the same population objective, as well as stochastic Hessian-vector products (products of an independent unbiased estimator of the Hessian of the population objective with arbitrary vectors), with many such stochastic computations performed between rounds of communication. We show that our method can reduce the number, and frequency, of required communication rounds compared to existing methods without hurting performance, by proving convergence guarantees for quasi-self-concordant objectives (e.g., logistic regression), alongside empirical evidence.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes