LGAICVApr 8

SubFLOT: Submodel Extraction for Efficient and Personalized Federated Learning via Optimal Transport

arXiv:2604.0663139.11 citationsh-index: 2
Predicted impact top 63% in LG · last 90 daysOriginality Highly original
AI Analysis

This addresses the problem of system and statistical heterogeneity in federated learning for resource-constrained edge devices, offering a novel solution that balances personalization and efficiency.

The paper tackles the challenge of efficient and personalized federated learning by proposing SubFLOT, which uses optimal transport to generate customized submodels without raw data access and adaptive regularization to stabilize training, achieving substantial performance gains over state-of-the-art methods.

Federated Learning (FL) enables collaborative model training while preserving data privacy, but its practical deployment is hampered by system and statistical heterogeneity. While federated network pruning offers a path to mitigate these issues, existing methods face a critical dilemma: server-side pruning lacks personalization, whereas client-side pruning is computationally prohibitive for resource-constrained devices. Furthermore, the pruning process itself induces significant parametric divergence among heterogeneous submodels, destabilizing training and hindering global convergence. To address these challenges, we propose SubFLOT, a novel framework for server-side personalized federated pruning. SubFLOT introduces an Optimal Transport-enhanced Pruning (OTP) module that treats historical client models as proxies for local data distributions, formulating the pruning task as a Wasserstein distance minimization problem to generate customized submodels without accessing raw data. Concurrently, to counteract parametric divergence, our Scaling-based Adaptive Regularization (SAR) module adaptively penalizes a submodel's deviation from the global model, with the penalty's strength scaled by the client's pruning rate. Comprehensive experiments demonstrate that SubFLOT consistently and substantially outperforms state-of-the-art methods, underscoring its potential for deploying efficient and personalized models on resource-constrained edge devices.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes