MLLGOCDec 31, 2019

Stochastic Recursive Variance Reduction for Efficient Smooth Non-Convex Compositional Optimization

arXiv:1912.13515v214 citations
Originality Incremental advance
AI Analysis

This addresses efficient optimization for tasks like reinforcement learning value function evaluation, but it is incremental as it builds on existing stochastic recursive gradient descent ideas.

The paper tackles stochastic compositional optimization in smooth non-convex settings, proposing the SARAH-Compositional algorithm and proving an IFO complexity upper bound of O((n+m)^{1/2} ε^{-2}) for finite-sum cases and O(ε^{-3}) for online cases, which is claimed to be optimal.

Stochastic compositional optimization arises in many important machine learning tasks such as value function evaluation in reinforcement learning and portfolio management. The objective function is the composition of two expectations of stochastic functions, and is more challenging to optimize than vanilla stochastic optimization problems. In this paper, we investigate the stochastic compositional optimization in the general smooth non-convex setting. We employ a recently developed idea of \textit{Stochastic Recursive Gradient Descent} to design a novel algorithm named SARAH-Compositional, and prove a sharp Incremental First-order Oracle (IFO) complexity upper bound for stochastic compositional optimization: $\mathcal{O}((n+m)^{1/2} \varepsilon^{-2})$ in the finite-sum case and $\mathcal{O}(\varepsilon^{-3})$ in the online case. Such a complexity is known to be the best one among IFO complexity results for non-convex stochastic compositional optimization, and is believed to be optimal. Our experiments validate the theoretical performance of our algorithm.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes