LG CV DCNov 10, 2022

Robust Federated Learning against both Data Heterogeneity and Poisoning Attack via Aggregation Optimization

Yueqi Xie, Weizhong Zhang, Renjie Pi, Fangzhao Wu, Qifeng Chen, Xing Xie, Sunghun Kim

arXiv:2211.05554v27.89 citationsh-index: 55

Originality Incremental advance

AI Analysis

This addresses robustness issues in real-world federated learning systems, offering a unified solution for heterogeneous and malicious clients, though it is incremental as it builds on existing aggregation techniques.

The paper tackles the dual challenges of non-IID data distribution and poisoning attacks in federated learning by proposing SmartFL, a method that optimizes server-side aggregation using a small proxy dataset, achieving state-of-the-art performance in both scenarios.

Non-IID data distribution across clients and poisoning attacks are two main challenges in real-world federated learning (FL) systems. While both of them have attracted great research interest with specific strategies developed, no known solution manages to address them in a unified framework. To universally overcome both challenges, we propose SmartFL, a generic approach that optimizes the server-side aggregation process with a small amount of proxy data collected by the service provider itself via a subspace training technique. Specifically, the aggregation weight of each participating client at each round is optimized using the server-collected proxy data, which is essentially the optimization of the global model in the convex hull spanned by client models. Since at each round, the number of tunable parameters optimized on the server side equals the number of participating clients (thus independent of the model size), we are able to train a global model with massive parameters using only a small amount of proxy data (e.g., around one hundred samples). With optimized aggregation, SmartFL ensures robustness against both heterogeneous and malicious clients, which is desirable in real-world FL where either or both problems may occur. We provide theoretical analyses of the convergence and generalization capacity for SmartFL. Empirically, SmartFL achieves state-of-the-art performance on both FL with non-IID data distribution and FL with malicious clients. The source code will be released.

View on arXiv PDF

Similar