LGJan 26, 2022

Fast Server Learning Rate Tuning for Coded Federated Dropout

Giacomo Verardo, Daniel Barreira, Marco Chiesa, Dejan Kostic, Gerald Q. Maguire

arXiv:2201.11036v41.8h-index: 36

Originality Incremental advance

AI Analysis

This work addresses communication efficiency and accuracy issues in Federated Learning for resource-constrained clients, representing an incremental improvement over existing Federated Dropout methods.

The paper tackles the problem of low accuracy and slow convergence in Federated Dropout for cross-device Federated Learning by introducing a coding theory-based enhancement and server learning rate tuning, achieving 99.6% of the final accuracy of the no dropout case with 2.43× less bandwidth on the EMNIST dataset.

In cross-device Federated Learning (FL), clients with low computational power train a common\linebreak[4] machine model by exchanging parameters via updates instead of potentially private data. Federated Dropout (FD) is a technique that improves the communication efficiency of a FL session by selecting a \emph{subset} of model parameters to be updated in each training round. However, compared to standard FL, FD produces considerably lower accuracy and faces a longer convergence time. In this paper, we leverage \textit{coding theory} to enhance FD by allowing different sub-models to be used at each client. We also show that by carefully tuning the server learning rate hyper-parameter, we can achieve higher training speed while also achieving up to the same final accuracy as the no dropout case. For the EMNIST dataset, our mechanism achieves 99.6\% of the final accuracy of the no dropout case while requiring $2.43\times$ less bandwidth to achieve this level of accuracy.

View on arXiv PDF

Similar