OCLGMLMar 4, 2018

A Distributed Quasi-Newton Algorithm for Empirical Risk Minimization with Nonsmooth Regularization

arXiv:1803.01370v229 citations
Originality Highly original
AI Analysis

This addresses the challenge of efficient distributed optimization for machine learning practitioners, offering a novel method for a known bottleneck in handling nonsmooth regularizers in distributed settings.

The paper tackles the problem of distributed optimization for empirical risk minimization with nonsmooth regularization by proposing a communication- and computation-efficient quasi-Newton algorithm, achieving global linear convergence for a broad range of non-strongly convex problems and demonstrating significant improvements in communication cost and running time over state-of-the-art methods.

We propose a communication- and computation-efficient distributed optimization algorithm using second-order information for solving ERM problems with a nonsmooth regularization term. Current second-order and quasi-Newton methods for this problem either do not work well in the distributed setting or work only for specific regularizers. Our algorithm uses successive quadratic approximations, and we describe how to maintain an approximation of the Hessian and solve subproblems efficiently in a distributed manner. The proposed method enjoys global linear convergence for a broad range of non-strongly convex problems that includes the most commonly used ERMs, thus requiring lower communication complexity. It also converges on non-convex problems, so has the potential to be used on applications such as deep learning. Initial computational results on convex problems demonstrate that our method significantly improves on communication cost and running time over the current state-of-the-art methods.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes