OCMLJul 25, 2016

Accelerating Stochastic Composition Optimization

arXiv:1607.07329v1161 citations
Originality Incremental advance
AI Analysis

This work addresses optimization challenges in machine learning, particularly for problems with nonsmooth regularization, offering incremental improvements in efficiency for applications like reinforcement learning.

The paper tackles the stochastic composition optimization problem by proposing the accelerated stochastic compositional proximal gradient (ASC-PG) method, which achieves faster convergence than existing algorithms and optimal sample-error complexity in key cases.

Consider the stochastic composition optimization problem where the objective is a composition of two expected-value functions. We propose a new stochastic first-order method, namely the accelerated stochastic compositional proximal gradient (ASC-PG) method, which updates based on queries to the sampling oracle using two different timescales. The ASC-PG is the first proximal gradient method for the stochastic composition problem that can deal with nonsmooth regularization penalty. We show that the ASC-PG exhibits faster convergence than the best known algorithms, and that it achieves the optimal sample-error complexity in several important special cases. We further demonstrate the application of ASC-PG to reinforcement learning and conduct numerical experiments.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes