OC CC LG NA MLMar 24, 2025

High Probability Complexity Bounds of Trust-Region Stochastic Sequential Quadratic Programming with Heavy-Tailed Noise

arXiv:2503.19091v213.05 citationsh-index: 37

Originality Incremental advance

AI Analysis

This work addresses optimization under heavy-tailed noise, extending analysis to second-order stationarity, which is incremental compared to prior studies focused on light-tailed noise and first-order methods.

The paper tackles nonlinear optimization with stochastic objectives and deterministic constraints by proposing a Trust-Region Stochastic Sequential Quadratic Programming (TR-SSQP) method, establishing high-probability iteration complexity bounds of O(ε^{-2}) for first-order and O(ε^{-3}) for second-order ε-stationary points under heavy-tailed noise conditions.

In this paper, we consider nonlinear optimization problems with a stochastic objective and deterministic equality constraints. We propose a Trust-Region Stochastic Sequential Quadratic Programming (TR-SSQP) method and establish its high-probability iteration complexity bounds for identifying first- and second-order $ε$-stationary points. In our algorithm, we assume that exact objective values, gradients, and Hessians are not directly accessible but can be estimated via zeroth-, first-, and second-order probabilistic oracles. Compared to existing complexity studies of SSQP methods that rely on a zeroth-order oracle with sub-exponential tail noise (i.e., light-tailed) and focus mostly on first-order stationarity, our analysis accommodates irreducible and heavy-tailed noise in the zeroth-order oracle and significantly extends the analysis to second-order stationarity. We show that under heavy-tailed noise conditions, our SSQP method achieves the same high-probability first-order iteration complexity bounds as in the light-tailed noise setting, while further exhibiting promising second-order iteration complexity bounds. Specifically, the method identifies a first-order $ε$-stationary point in $\mathcal{O}(ε^{-2})$ iterations and a second-order $ε$-stationary point in $\mathcal{O}(ε^{-3})$ iterations with high probability, provided that $ε$ is lower bounded by a constant determined by the irreducible noise level in estimation. We validate our theoretical findings and evaluate the practical performance of our method on CUTEst benchmark test set.

View on arXiv PDF

Similar