LGAIMay 11

Provable Sparse Inversion and Token Relabel Enhanced One-shot Federated Learning with ViTs

arXiv:2605.1074853.3
Predicted impact top 46% in LG · last 90 daysOriginality Incremental advance
AI Analysis

Improves one-shot federated learning for heterogeneous clients, a practical but challenging scenario.

FedMITR addresses semantic misalignment in one-shot federated learning under non-IID settings by using sparse model inversion to focus on foreground patches and token relabeling to handle low-information patches, achieving tighter generalization bounds and outperforming baselines.

One-Shot Federated Learning, where a central server learns a global model in a single communication round, has emerged as a promising paradigm. However, under extremely non-IID settings, existing data-free methods often generate low-quality data that suffers from severe semantic misalignment with ground-truth labels. To overcome these issues, we propose a novel Federated Model Inversion and Token Relabel (FedMITR) framework, which trains the global model by fully exploiting all patches of synthetic images. Specifically, FedMITR employs sparse model inversion during data generation, selectively inverting semantic foregrounds while halting the inversion of uninformative backgrounds. To address semantically meaningless tokens that hinder ViT predictions, we implement a differentiated strategy: patches with high information density utilize generated pseudo-labels, while patches with low information density are relabeled via ensemble models for robust distillation. Theoretically, our analysis based on algorithmic stability reveals that Sparse Model Inversion eliminates gradient instability arising from background noise, while Token Relabel effectively reduces gradient variance, collectively guaranteeing a tighter generalization bound. Empirically, extensive experimental results demonstrate that FedMITR substantially outperforms existing baselines under various settings.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes