LGAIDCApr 3

FedSQ: Optimized Weight Averaging via Fixed Gating

arXiv:2604.029904.6h-index: 4
Predicted impact top 87% in LG · last 90 daysOriginality Incremental advance
AI Analysis

This addresses instability in federated learning for cross-silo deployments, but it is incremental as it builds on existing transfer-initialized methods.

The paper tackled the problem of instability in federated learning due to statistical heterogeneity by proposing FedSQ, which freezes structural gating masks from a pretrained model and only optimizes quantitative parameters, reducing rounds-to-best validation performance while preserving accuracy.

Federated learning (FL) enables collaborative training across organizations without sharing raw data, but it is hindered by statistical heterogeneity (non-i.i.d.\ client data) and by instability of naive weight averaging under client drift. In many cross-silo deployments, FL is warm-started from a strong pretrained backbone (e.g., ImageNet-1K) and then adapted to local domains. Motivated by recent evidence that ReLU-like gating regimes (structural knowledge) stabilize earlier than the remaining parameter values (quantitative knowledge), we propose FedSQ (Federated Structural-Quantitative learning), a transfer-initialized neural federated procedure based on a DualCopy, piecewise-linear view of deep networks. FedSQ freezes a structural copy of the pretrained model to induce fixed binary gating masks during federated fine-tuning, while only a quantitative copy is optimized locally and aggregated across rounds. Fixing the gating reduces learning to within-regime affine refinements, which stabilizes aggregation under heterogeneous partitions. Experiments on two convolutional neural network backbones under i.i.d.\ and Dirichlet splits show that FedSQ improves robustness and can reduce rounds-to-best validation performance relative to standard baselines while preserving accuracy in the transfer setting.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes