LGOct 19, 2021

Layer-wise Adaptive Model Aggregation for Scalable Federated Learning

Sunwoo Lee, Tuo Zhang, Chaoyang He, Salman Avestimehr

arXiv:2110.10302v319.988 citations

Originality Synthesis-oriented

AI Analysis

This work addresses communication inefficiency in Federated Learning, which is a domain-specific incremental improvement for scalable distributed training.

The paper tackled the problem of inefficient network bandwidth consumption in Federated Learning by proposing FedLAMA, a layer-wise adaptive model aggregation scheme that reduces communication cost by up to 60-70% while maintaining comparable accuracy to FedAvg.

In Federated Learning, a common approach for aggregating local models across clients is periodic averaging of the full model parameters. It is, however, known that different layers of neural networks can have a different degree of model discrepancy across the clients. The conventional full aggregation scheme does not consider such a difference and synchronizes the whole model parameters at once, resulting in inefficient network bandwidth consumption. Aggregating the parameters that are similar across the clients does not make meaningful training progress while increasing the communication cost. We propose FedLAMA, a layer-wise model aggregation scheme for scalable Federated Learning. FedLAMA adaptively adjusts the aggregation interval in a layer-wise manner, jointly considering the model discrepancy and the communication cost. The layer-wise aggregation method enables to finely control the aggregation interval to relax the aggregation frequency without a significant impact on the model accuracy. Our empirical study shows that FedLAMA reduces the communication cost by up to 60% for IID data and 70% for non-IID data while achieving a comparable accuracy to FedAvg.

View on arXiv PDF

Similar