OCDSNAMLDec 30, 2015

Even Faster Accelerated Coordinate Descent Using Non-Uniform Sampling

arXiv:1512.09103v3180 citations
Originality Highly original
AI Analysis

This work provides a significant speed-up for optimization problems in machine learning and large-scale systems, though it is incremental as it builds on existing accelerated coordinate descent methods.

The paper tackles the problem of improving the running time of accelerated coordinate descent by introducing a novel non-uniform sampling method that selects coordinates based on the square root of their smoothness parameters, achieving a speed-up factor of up to √n. This improvement applies to key problems like empirical risk minimization and solving linear systems, with practical benefits demonstrated.

Accelerated coordinate descent is widely used in optimization due to its cheap per-iteration cost and scalability to large-scale problems. Up to a primal-dual transformation, it is also the same as accelerated stochastic gradient descent that is one of the central methods used in machine learning. In this paper, we improve the best known running time of accelerated coordinate descent by a factor up to $\sqrt{n}$. Our improvement is based on a clean, novel non-uniform sampling that selects each coordinate with a probability proportional to the square root of its smoothness parameter. Our proof technique also deviates from the classical estimation sequence technique used in prior work. Our speed-up applies to important problems such as empirical risk minimization and solving linear systems, both in theory and in practice.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes