LGSYOCMLApr 27, 2025

$O(1/k)$ Finite-Time Bound for Non-Linear Two-Time-Scale Stochastic Approximation

arXiv:2504.19375v18 citationsh-index: 5
Originality Highly original
AI Analysis

This work provides a theoretical advancement for researchers in reinforcement learning, optimization, and game control by addressing a bottleneck in convergence analysis for non-linear iterative methods.

The paper tackles the problem of improving convergence bounds for non-linear two-time-scale stochastic approximation algorithms, achieving an O(1/k) finite-time bound, which matches the best known rate for linear cases and improves upon the previous O(1/k^{2/3}) bound.

Two-time-scale stochastic approximation is an algorithm with coupled iterations which has found broad applications in reinforcement learning, optimization and game control. While several prior works have obtained a mean square error bound of $O(1/k)$ for linear two-time-scale iterations, the best known bound in the non-linear contractive setting has been $O(1/k^{2/3})$. In this work, we obtain an improved bound of $O(1/k)$ for non-linear two-time-scale stochastic approximation. Our result applies to algorithms such as gradient descent-ascent and two-time-scale Lagrangian optimization. The key step in our analysis involves rewriting the original iteration in terms of an averaged noise sequence which decays sufficiently fast. Additionally, we use an induction-based approach to show that the iterates are bounded in expectation.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes