OCLGApr 7

Stochastic Auto-conditioned Fast Gradient Methods with Optimal Rates

arXiv:2604.0652542.6h-index: 2
Predicted impact top 26% in OC · last 90 daysOriginality Highly original
AI Analysis

This work addresses a central problem in optimization theory for researchers and practitioners by providing a parameter-free method that overcomes limitations of existing stochastic approaches.

The paper tackles the challenge of achieving optimal rates for stochastic composite convex optimization without prior knowledge of problem parameters, proposing a stochastic variant of the auto-conditioned fast gradient method that attains optimal iteration complexity of O(1/√ε) and sample complexity of O(1/ε²).

Achieving optimal rates for stochastic composite convex optimization without prior knowledge of problem parameters remains a central challenge. In the deterministic setting, the auto-conditioned fast gradient method has recently been proposed to attain optimal accelerated rates without line-search procedures or prior knowledge of the Lipschitz smoothness constant, providing a natural prototype for parameter-free acceleration. However, extending this approach to the stochastic setting has proven technically challenging and remains open. Existing parameter-free stochastic methods either fail to achieve accelerated rates or rely on restrictive assumptions, such as bounded domains, bounded gradients, prior knowledge of the iteration horizon, or strictly sub-Gaussian noise. To address these limitations, we propose a stochastic variant of the auto-conditioned fast gradient method, referred to as stochastic AC-FGM. The proposed method is fully adaptive to the Lipschitz constant, the iteration horizon, and the noise level, enabling both adaptive stepsize selection and adaptive mini-batch sizing without line-search procedures. Under standard bounded conditional variance assumptions, we show that stochastic AC-FGM achieves the optimal iteration complexity of $O(1/\sqrt{\varepsilon})$ and the optimal sample complexity of $O(1/\varepsilon^2)$.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes