LGOCMLJan 24, 2019

SAGA with Arbitrary Sampling

arXiv:1901.08669v127 citations
Originality Incremental advance
AI Analysis

This work addresses a gap in optimization methods for large-scale machine learning training, offering incremental improvements in flexibility for finite sum problems.

The authors tackled the lack of a general-purpose SAGA algorithm supporting arbitrary importance sampling and minibatching by proposing a flexible variant, establishing linear convergence rates under smooth and strongly convex conditions and a quadratic growth condition, matching the rates of the primal-dual method Quartz.

We study the problem of minimizing the average of a very large number of smooth functions, which is of key importance in training supervised learning models. One of the most celebrated methods in this context is the SAGA algorithm. Despite years of research on the topic, a general-purpose version of SAGA---one that would include arbitrary importance sampling and minibatching schemes---does not exist. We remedy this situation and propose a general and flexible variant of SAGA following the {\em arbitrary sampling} paradigm. We perform an iteration complexity analysis of the method, largely possible due to the construction of new stochastic Lyapunov functions. We establish linear convergence rates in the smooth and strongly convex regime, and under a quadratic functional growth condition (i.e., in a regime not assuming strong convexity). Our rates match those of the primal-dual method Quartz for which an arbitrary sampling analysis is available, which makes a significant step towards closing the gap in our understanding of complexity of primal and dual methods for finite sum problems.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes