LGAIDCApr 21, 2024

FedTrans: Efficient Federated Learning via Multi-Model Transformation

arXiv:2404.13515v22 citationsh-index: 13MLSys
Originality Incremental advance
AI Analysis

This work addresses the problem of high training costs and suboptimal accuracy in federated learning for edge devices, offering a scalable solution that is incremental but with strong practical gains.

FedTrans tackles the challenge of training efficient and accurate models for heterogeneous clients in federated learning by introducing a multi-model framework that automatically generates and trains client-specific models, achieving 14%-72% accuracy improvements and 1.6X-20X cost reductions over state-of-the-art methods.

Federated learning (FL) aims to train machine learning (ML) models across potentially millions of edge client devices. Yet, training and customizing models for FL clients is notoriously challenging due to the heterogeneity of client data, device capabilities, and the massive scale of clients, making individualized model exploration prohibitively expensive. State-of-the-art FL solutions personalize a globally trained model or concurrently train multiple models, but they often incur suboptimal model accuracy and huge training costs. In this paper, we introduce FedTrans, a multi-model FL training framework that automatically produces and trains high-accuracy, hardware-compatible models for individual clients at scale. FedTrans begins with a basic global model, identifies accuracy bottlenecks in model architectures during training, and then employs model transformation to derive new models for heterogeneous clients on the fly. It judiciously assigns models to individual clients while performing soft aggregation on multi-model updates to minimize total training costs. Our evaluations using realistic settings show that FedTrans improves individual client model accuracy by 14% - 72% while slashing training costs by 1.6X - 20X over state-of-the-art solutions.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes