LGCVDec 18, 2024

Covariances for Free: Exploiting Mean Distributions for Training-free Federated Learning

arXiv:2412.14326v32 citationsh-index: 36Has Code
Originality Incremental advance
AI Analysis

This work addresses communication efficiency in federated learning for distributed systems, offering an incremental improvement over existing training-free methods.

The paper tackles the problem of communication overhead in federated learning by proposing a training-free method that uses only class means to estimate class covariances for initializing the global classifier, achieving performance improvements of 4-26% with the same communication cost and outperforming methods with higher overhead.

Using pre-trained models has been found to reduce the effect of data heterogeneity and speed up federated learning algorithms. Recent works have explored training-free methods using first- and second-order statistics to aggregate local client data distributions at the server and achieve high performance without any training. In this work, we propose a training-free method based on an unbiased estimator of class covariance matrices which only uses first-order statistics in the form of class means communicated by clients to the server. We show how these estimated class covariances can be used to initialize the global classifier, thus exploiting the covariances without actually sharing them. We also show that using only within-class covariances results in a better classifier initialization. Our approach improves performance in the range of 4-26% with exactly the same communication cost when compared to methods sharing only class means and achieves performance competitive or superior to methods sharing second-order statistics with dramatically less communication overhead. The proposed method is much more communication-efficient than federated prompt-tuning methods and still outperforms them. Finally, using our method to initialize classifiers and then performing federated fine-tuning or linear probing again yields better performance. Code is available at https://github.com/dipamgoswami/FedCOF.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes