MLLGJun 12, 2015

Adaptive Stochastic Primal-Dual Coordinate Descent for Separable Saddle Point Problems

arXiv:1506.04093v119 citations
Originality Incremental advance
AI Analysis

This incremental improvement addresses large-scale optimization problems in machine learning, such as regularized empirical risk minimization, by enhancing efficiency and parallelizability.

The authors tackled separable convex-concave saddle point problems common in machine learning by proposing an adaptive stochastic primal-dual coordinate descent method, which theoretically achieves a sharper linear convergence rate and performs comparably or better than state-of-the-art methods on synthetic and real-world datasets.

We consider a generic convex-concave saddle point problem with separable structure, a form that covers a wide-ranged machine learning applications. Under this problem structure, we follow the framework of primal-dual updates for saddle point problems, and incorporate stochastic block coordinate descent with adaptive stepsize into this framework. We theoretically show that our proposal of adaptive stepsize potentially achieves a sharper linear convergence rate compared with the existing methods. Additionally, since we can select "mini-batch" of block coordinates to update, our method is also amenable to parallel processing for large-scale data. We apply the proposed method to regularized empirical risk minimization and show that it performs comparably or, more often, better than state-of-the-art methods on both synthetic and real-world data sets.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes