LGOCMar 28, 2023

Concentration of Contractive Stochastic Approximation: Additive and Multiplicative Noise

arXiv:2303.15740v228 citationsh-index: 27
Originality Incremental advance
AI Analysis

This work provides theoretical guarantees for stochastic approximation methods, which are incremental as it refines existing concentration bounds for specific noise types in optimization and reinforcement learning.

The paper establishes maximal concentration bounds for stochastic approximation algorithms under contractive operators, showing that convergence error has a sub-Gaussian tail for additive noise and a Weibull tail for multiplicative noise, with an impossibility result for sub-exponential tails under multiplicative noise.

In this paper, we establish maximal concentration bounds for the iterates generated by a stochastic approximation (SA) algorithm under a contractive operator with respect to some arbitrary norm (for example, the $\ell_\infty$-norm). We consider two settings where the iterates are potentially unbounded: SA with bounded multiplicative noise and SA with sub-Gaussian additive noise. Our maximal concentration inequalities state that the convergence error has a sub-Gaussian tail in the additive noise setting and a Weibull tail (which is faster than polynomial decay but could be slower than exponential decay) in the multiplicative noise setting. In addition, we provide an impossibility result showing that it is generally impossible to have sub-exponential tails under multiplicative noise. To establish the maximal concentration bounds, we develop a novel bootstrapping argument that involves bounding the moment-generating function of a modified version of the generalized Moreau envelope of the convergence error and constructing an exponential supermartingale to enable using Ville's maximal inequality. We demonstrate the applicability of our theoretical results in the context of linear SA and reinforcement learning.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes