DCLGFeb 23, 2024

Convergence Analysis of Split Federated Learning on Heterogeneous Data

arXiv:2402.15166v326 citationsh-index: 4NIPS
Originality Incremental advance
AI Analysis

It addresses a theoretical gap for researchers in distributed machine learning, but is incremental as it extends existing analysis to SFL.

This paper tackles the lack of convergence analysis for split federated learning (SFL) by providing theoretical convergence rates for strongly convex, general convex, and non-convex objectives on heterogeneous data, with rates of O(1/T) and O(1/∛T), and shows SFL outperforms FL and split learning in experiments with highly heterogeneous data.

Split federated learning (SFL) is a recent distributed approach for collaborative model training among multiple clients. In SFL, a global model is typically split into two parts, where clients train one part in a parallel federated manner, and a main server trains the other. Despite the recent research on SFL algorithm development, the convergence analysis of SFL is missing in the literature, and this paper aims to fill this gap. The analysis of SFL can be more challenging than that of federated learning (FL), due to the potential dual-paced updates at the clients and the main server. We provide convergence analysis of SFL for strongly convex and general convex objectives on heterogeneous data. The convergence rates are $O(1/T)$ and $O(1/\sqrt[3]{T})$, respectively, where $T$ denotes the total number of rounds for SFL training. We further extend the analysis to non-convex objectives and the scenario where some clients may be unavailable during training. Experimental experiments validate our theoretical results and show that SFL outperforms FL and split learning (SL) when data is highly heterogeneous across a large number of clients.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes