LGOCOct 8, 2023

Asymmetrically Decentralized Federated Learning

arXiv:2310.05093v18 citationsh-index: 12
Originality Incremental advance
AI Analysis

This work addresses communication and privacy problems in federated learning for distributed machine learning systems, representing an incremental improvement over existing decentralized methods.

The paper tackles the issues of deadlocks and network sensitivity in decentralized federated learning by proposing the DFedSGPSM algorithm, which uses asymmetric topologies and the Push-Sum protocol, achieving a convergence rate of O(1/√T) and demonstrating superior performance on MNIST, CIFAR10, and CIFAR100 datasets compared to state-of-the-art optimizers.

To address the communication burden and privacy concerns associated with the centralized server in Federated Learning (FL), Decentralized Federated Learning (DFL) has emerged, which discards the server with a peer-to-peer (P2P) communication framework. However, most existing DFL algorithms are based on symmetric topologies, such as ring and grid topologies, which can easily lead to deadlocks and are susceptible to the impact of network link quality in practice. To address these issues, this paper proposes the DFedSGPSM algorithm, which is based on asymmetric topologies and utilizes the Push-Sum protocol to effectively solve consensus optimization problems. To further improve algorithm performance and alleviate local heterogeneous overfitting in Federated Learning (FL), our algorithm combines the Sharpness Aware Minimization (SAM) optimizer and local momentum. The SAM optimizer employs gradient perturbations to generate locally flat models and searches for models with uniformly low loss values, mitigating local heterogeneous overfitting. The local momentum accelerates the optimization process of the SAM optimizer. Theoretical analysis proves that DFedSGPSM achieves a convergence rate of $\mathcal{O}(\frac{1}{\sqrt{T}})$ in a non-convex smooth setting under mild assumptions. This analysis also reveals that better topological connectivity achieves tighter upper bounds. Empirically, extensive experiments are conducted on the MNIST, CIFAR10, and CIFAR100 datasets, demonstrating the superior performance of our algorithm compared to state-of-the-art optimizers.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes