OCLGNANov 21, 2014

Randomized Dual Coordinate Ascent with Arbitrary Sampling

arXiv:1411.5873v158 citations
Originality Incremental advance
AI Analysis

This work addresses optimization challenges in large-scale machine learning by providing a flexible and efficient method for practitioners, though it is incremental as it builds on existing SDCA-like approaches.

The authors tackled the problem of minimizing the average of many smooth convex functions with a strongly convex regularizer by proposing Quartz, a primal-dual method that samples and updates random subsets of dual variables using arbitrary distributions, achieving bounds that match or exceed existing methods in serial, parallel, and distributed settings, with theoretical speedup factors accurately predicting practical performance.

We study the problem of minimizing the average of a large number of smooth convex functions penalized with a strongly convex regularizer. We propose and analyze a novel primal-dual method (Quartz) which at every iteration samples and updates a random subset of the dual variables, chosen according to an arbitrary distribution. In contrast to typical analysis, we directly bound the decrease of the primal-dual error (in expectation), without the need to first analyze the dual error. Depending on the choice of the sampling, we obtain efficient serial, parallel and distributed variants of the method. In the serial case, our bounds match the best known bounds for SDCA (both with uniform and importance sampling). With standard mini-batching, our bounds predict initial data-independent speedup as well as additional data-driven speedup which depends on spectral and sparsity properties of the data. We calculate theoretical speedup factors and find that they are excellent predictors of actual speedup in practice. Moreover, we illustrate that it is possible to design an efficient mini-batch importance sampling. The distributed variant of Quartz is the first distributed SDCA-like method with an analysis for non-separable data.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes