OC LG MLSep 6, 2018

Stochastically Controlled Stochastic Gradient for the Convex and Non-convex Composition problem

Liu Liu, Ji Liu, Cho-Jui Hsieh, Dacheng Tao

arXiv:1809.02505v116.613 citations

Originality Incremental advance

AI Analysis

This addresses a computational bottleneck in large-scale composition optimization for machine learning practitioners, representing an incremental improvement.

The paper tackles the convex and non-convex composition optimization problem by proposing a stochastically controlled stochastic gradient (SCSG) method to estimate gradients efficiently, achieving query complexity equal to or better than current methods.

In this paper, we consider the convex and non-convex composition problem with the structure $\frac{1}{n}\sum\nolimits_{i = 1}^n {{F_i}( {G( x )} )}$, where $G( x )=\frac{1}{n}\sum\nolimits_{j = 1}^n {{G_j}( x )} $ is the inner function, and $F_i(\cdot)$ is the outer function. We explore the variance reduction based method to solve the composition optimization. Due to the fact that when the number of inner function and outer function are large, it is not reasonable to estimate them directly, thus we apply the stochastically controlled stochastic gradient (SCSG) method to estimate the gradient of the composition function and the value of the inner function. The query complexity of our proposed method for the convex and non-convex problem is equal to or better than the current method for the composition problem. Furthermore, we also present the mini-batch version of the proposed method, which has the improved the query complexity with related to the size of the mini-batch.

View on arXiv PDF

Similar