LG OCSep 29, 2025

Distributionally Robust Federated Learning with Outlier Resilience

Zifan Wang, Xinlei Yi, Xenia Konti, Michael M. Zavlanos, Karl H. Johansson

arXiv:2509.24462v14.1h-index: 40

Originality Incremental advance

AI Analysis

This work addresses the challenge of outlier resilience in federated learning for applications like healthcare or finance, though it is incremental as it builds on existing distributionally robust optimization frameworks.

The paper tackles the problem of performance degradation in federated learning due to data distribution perturbations and outliers by introducing a distributionally robust optimization method with explicit outlier resilience, achieving improved robustness in experiments on synthetic and real-world datasets.

Federated learning (FL) enables collaborative model training without direct data sharing, but its performance can degrade significantly in the presence of data distribution perturbations. Distributionally robust optimization (DRO) provides a principled framework for handling this by optimizing performance against the worst-case distributions within a prescribed ambiguity set. However, existing DRO-based FL methods often overlook the detrimental impact of outliers in local datasets, which can disproportionately bias the learned models. In this work, we study distributionally robust federated learning with explicit outlier resilience. We introduce a novel ambiguity set based on the unbalanced Wasserstein distance, which jointly captures geometric distributional shifts and incorporates a non-geometric Kullback--Leibler penalization to mitigate the influence of outliers. This formulation naturally leads to a challenging min--max--max optimization problem. To enable decentralized training, we reformulate the problem as a tractable Lagrangian penalty optimization, which admits robustness certificates. Building on this reformulation, we propose the distributionally outlier-robust federated learning algorithm and establish its convergence guarantees. Extensive experiments on both synthetic and real-world datasets demonstrate the effectiveness of our approach.

View on arXiv PDF

Similar