MLLGDec 11, 2016

Lock-Free Optimization for Non-Convex Problems

arXiv:1612.03441v16 citations
Originality Incremental advance
AI Analysis

This addresses a theoretical gap for researchers and practitioners using parallel optimization in machine learning, though it is incremental as it extends existing methods to non-convex settings.

The paper tackles the lack of convergence proofs for lock-free parallel stochastic gradient descent methods in non-convex optimization, providing theoretical proofs and empirical verification for Hogwild! and AsySVRG, showing they converge on non-convex problems.

Stochastic gradient descent~(SGD) and its variants have attracted much attention in machine learning due to their efficiency and effectiveness for optimization. To handle large-scale problems, researchers have recently proposed several lock-free strategy based parallel SGD~(LF-PSGD) methods for multi-core systems. However, existing works have only proved the convergence of these LF-PSGD methods for convex problems. To the best of our knowledge, no work has proved the convergence of the LF-PSGD methods for non-convex problems. In this paper, we provide the theoretical proof about the convergence of two representative LF-PSGD methods, Hogwild! and AsySVRG, for non-convex problems. Empirical results also show that both Hogwild! and AsySVRG are convergent on non-convex problems, which successfully verifies our theoretical results.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes