OCLGMLSep 6, 2018

Stochastically Controlled Stochastic Gradient for the Convex and Non-convex Composition problem

arXiv:1809.02505v113 citations
Originality Incremental advance
AI Analysis

This addresses a computational bottleneck in large-scale composition optimization for machine learning practitioners, representing an incremental improvement.

The paper tackles the convex and non-convex composition optimization problem by proposing a stochastically controlled stochastic gradient (SCSG) method to estimate gradients efficiently, achieving query complexity equal to or better than current methods.

In this paper, we consider the convex and non-convex composition problem with the structure $\frac{1}{n}\sum\nolimits_{i = 1}^n {{F_i}( {G( x )} )}$, where $G( x )=\frac{1}{n}\sum\nolimits_{j = 1}^n {{G_j}( x )} $ is the inner function, and $F_i(\cdot)$ is the outer function. We explore the variance reduction based method to solve the composition optimization. Due to the fact that when the number of inner function and outer function are large, it is not reasonable to estimate them directly, thus we apply the stochastically controlled stochastic gradient (SCSG) method to estimate the gradient of the composition function and the value of the inner function. The query complexity of our proposed method for the convex and non-convex problem is equal to or better than the current method for the composition problem. Furthermore, we also present the mini-batch version of the proposed method, which has the improved the query complexity with related to the size of the mini-batch.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes