LG DCMar 6

First-Order Softmax Weighted Switching Gradient Method for Distributed Stochastic Minimax Optimization with Stochastic Constraints

Zhankun Luo, Antesh Upadhyay, Sang Bin Moon, Abolfazl Hashemi

arXiv:2603.05774v17.5h-index: 4

Predicted impact top 58% in LG · last 90 daysOriginality Incremental advance

AI Analysis

This addresses the problem of optimizing worst-case client performance in federated learning with constraints, offering a stable alternative to traditional methods, though it is incremental as it builds on existing distributed optimization frameworks.

The paper tackles distributed stochastic minimax optimization with stochastic constraints in federated learning, proposing a first-order Softmax-Weighted Switching Gradient method that achieves O(ε⁻⁴) oracle complexity for optimality and feasibility under full client participation, with high-probability convergence guarantees and experimental validation on classification tasks.

This paper addresses the distributed stochastic minimax optimization problem subject to stochastic constraints. We propose a novel first-order Softmax-Weighted Switching Gradient method tailored for federated learning. Under full client participation, our algorithm achieves the standard $\mathcal{O}(ε^{-4})$ oracle complexity to satisfy a unified bound $ε$ for both the optimality gap and feasibility tolerance. We extend our theoretical analysis to the practical partial participation regime by quantifying client sampling noise through a stochastic superiority assumption. Furthermore, by relaxing standard boundedness assumptions on the objective functions, we establish a strictly tighter lower bound for the softmax hyperparameter. We provide a unified error decomposition and establish a sharp $\mathcal{O}(\log\frac{1}δ)$ high-probability convergence guarantee. Ultimately, our framework demonstrates that a single-loop primal-only switching mechanism provides a stable alternative for optimizing worst-case client performance, effectively bypassing the hyperparameter sensitivity and convergence oscillations often encountered in traditional primal-dual or penalty-based approaches. We verify the efficacy of our algorithm via experiment on the Neyman-Pearson (NP) classification and fair classification tasks.

View on arXiv PDF

Similar