LG DCMay 28, 2025

Hybrid Batch Normalisation: Resolving the Dilemma of Batch Normalisation in Federated Learning

Hongyao Chen, Tianyang Xu, Xiaojun Wu, Josef Kittler

arXiv:2505.21877v14.1h-index: 17Has CodeICML

Originality Incremental advance

AI Analysis

This addresses a critical bottleneck in federated learning for distributed systems, offering an incremental improvement over existing normalization methods.

The paper tackles the performance degradation of Batch Normalization in federated learning due to non-IID data by proposing Hybrid Batch Normalization (HBN), which separates statistical and learnable parameter updates and adaptively mixes local and global statistics, achieving improved performance across various settings, especially for small batch sizes and heterogeneous data.

Batch Normalisation (BN) is widely used in conventional deep neural network training to harmonise the input-output distributions for each batch of data. However, federated learning, a distributed learning paradigm, faces the challenge of dealing with non-independent and identically distributed data among the client nodes. Due to the lack of a coherent methodology for updating BN statistical parameters, standard BN degrades the federated learning performance. To this end, it is urgent to explore an alternative normalisation solution for federated learning. In this work, we resolve the dilemma of the BN layer in federated learning by developing a customised normalisation approach, Hybrid Batch Normalisation (HBN). HBN separates the update of statistical parameters (i.e. , means and variances used for evaluation) from that of learnable parameters (i.e. , parameters that require gradient updates), obtaining unbiased estimates of global statistical parameters in distributed scenarios. In contrast with the existing solutions, we emphasise the supportive power of global statistics for federated learning. The HBN layer introduces a learnable hybrid distribution factor, allowing each computing node to adaptively mix the statistical parameters of the current batch with the global statistics. Our HBN can serve as a powerful plugin to advance federated learning performance. It reflects promising merits across a wide range of federated learning settings, especially for small batch sizes and heterogeneous data.

View on arXiv PDF Code

Similar