OC LG MLMar 4, 2025

Enhancing Distributional Robustness in Principal Component Analysis by Wasserstein Distances

arXiv:2503.02494v2h-index: 6

AI Analysis

This work addresses uncertainty in probability distributions for PCA, which is incremental as it applies DRO to a classical method.

The paper tackles distributional robustness in PCA by formulating a min-max optimization problem using Wasserstein distances, and it develops a smoothing manifold proximal gradient algorithm with proven convergence and O(ε^{-3}) iteration complexity.

We consider the distributionally robust optimization (DRO) model of principal component analysis (PCA) to account for uncertainty in the underlying probability distribution. The resulting formulation leads to a nonsmooth constrained min-max optimization problem, where the ambiguity set captures the distributional uncertainty by the type-$2$ Wasserstein distance. We prove that the inner maximization problem admits a closed-form optimal value. This explicit characterization equivalently reformulates the original DRO model into a minimization problem on the Stiefel manifold with intricate nonsmooth terms, a challenging formulation beyond the reach of existing algorithms. To address this issue, we devise an efficient smoothing manifold proximal gradient algorithm. Our analysis establishes Riemannian gradient consistency and global convergence of our algorithm to a stationary point of the nonsmooth minimization problem. We also provide the iteration complexity $O(ε^{-3})$ of our algorithm to achieve an $ε$-approximate stationary point. Finally, numerical experiments are conducted to validate the effectiveness and scalability of our algorithm, as well as to highlight the necessity and rationality of adopting the DRO model for PCA.

View on arXiv PDF

Similar