OC LGSep 18, 2023

Distributionally Time-Varying Online Stochastic Optimization under Polyak-Łojasiewicz Condition with Application in Conditional Value-at-Risk Statistical Learning

Yuen-Man Pun, Farhad Farokhi, Iman Shames

arXiv:2309.09411v17.64 citationsh-index: 34

Originality Incremental advance

AI Analysis

This work addresses incremental improvements in optimization theory for machine learning applications, specifically in risk-aware learning scenarios.

The authors tackled the problem of online stochastic optimization under time-varying distributions by establishing dynamic regret bounds for online stochastic gradient descent under the Polyak-Łojasiewicz condition, with applications to Conditional Value-at-Risk learning, resulting in improved regret bounds.

In this work, we consider a sequence of stochastic optimization problems following a time-varying distribution via the lens of online optimization. Assuming that the loss function satisfies the Polyak-Łojasiewicz condition, we apply online stochastic gradient descent and establish its dynamic regret bound that is composed of cumulative distribution drifts and cumulative gradient biases caused by stochasticity. The distribution metric we adopt here is Wasserstein distance, which is well-defined without the absolute continuity assumption or with a time-varying support set. We also establish a regret bound of online stochastic proximal gradient descent when the objective function is regularized. Moreover, we show that the above framework can be applied to the Conditional Value-at-Risk (CVaR) learning problem. Particularly, we improve an existing proof on the discovery of the PL condition of the CVaR problem, resulting in a regret bound of online stochastic gradient descent.

View on arXiv PDF

Similar