DCAILGDec 23, 2023

Efficient Asynchronous Federated Learning with Sparsification and Quantization

arXiv:2312.15186v220 citationsConcurr Comput
Originality Incremental advance
AI Analysis

This addresses efficiency issues in federated learning for edge computing, but it is incremental as it builds on existing methods with optimizations.

The paper tackles the problem of straggler devices and communication bottlenecks in federated learning by proposing TEASQ-Fed, which uses asynchronous training, sparsification, and quantization to improve accuracy by up to 16.67% and accelerate convergence by up to twice as fast.

While data is distributed in multiple edge devices, Federated Learning (FL) is attracting more and more attention to collaboratively train a machine learning model without transferring raw data. FL generally exploits a parameter server and a large number of edge devices during the whole process of the model training, while several devices are selected in each round. However, straggler devices may slow down the training process or even make the system crash during training. Meanwhile, other idle edge devices remain unused. As the bandwidth between the devices and the server is relatively low, the communication of intermediate data becomes a bottleneck. In this paper, we propose Time-Efficient Asynchronous federated learning with Sparsification and Quantization, i.e., TEASQ-Fed. TEASQ-Fed can fully exploit edge devices to asynchronously participate in the training process by actively applying for tasks. We utilize control parameters to choose an appropriate number of parallel edge devices, which simultaneously execute the training tasks. In addition, we introduce a caching mechanism and weighted averaging with respect to model staleness to further improve the accuracy. Furthermore, we propose a sparsification and quantitation approach to compress the intermediate data to accelerate the training. The experimental results reveal that TEASQ-Fed improves the accuracy (up to 16.67% higher) while accelerating the convergence of model training (up to twice faster).

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes