LGOct 19, 2021

Layer-wise Adaptive Model Aggregation for Scalable Federated Learning

arXiv:2110.10302v388 citations
Originality Synthesis-oriented
AI Analysis

This work addresses communication inefficiency in Federated Learning, which is a domain-specific incremental improvement for scalable distributed training.

The paper tackled the problem of inefficient network bandwidth consumption in Federated Learning by proposing FedLAMA, a layer-wise adaptive model aggregation scheme that reduces communication cost by up to 60-70% while maintaining comparable accuracy to FedAvg.

In Federated Learning, a common approach for aggregating local models across clients is periodic averaging of the full model parameters. It is, however, known that different layers of neural networks can have a different degree of model discrepancy across the clients. The conventional full aggregation scheme does not consider such a difference and synchronizes the whole model parameters at once, resulting in inefficient network bandwidth consumption. Aggregating the parameters that are similar across the clients does not make meaningful training progress while increasing the communication cost. We propose FedLAMA, a layer-wise model aggregation scheme for scalable Federated Learning. FedLAMA adaptively adjusts the aggregation interval in a layer-wise manner, jointly considering the model discrepancy and the communication cost. The layer-wise aggregation method enables to finely control the aggregation interval to relax the aggregation frequency without a significant impact on the model accuracy. Our empirical study shows that FedLAMA reduces the communication cost by up to 60% for IID data and 70% for non-IID data while achieving a comparable accuracy to FedAvg.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes