CVAug 14, 2025

Generalizable Federated Learning using Client Adaptive Focal Modulation

arXiv:2508.10840v1h-index: 6Has Code
Originality Incremental advance
AI Analysis

This work addresses the problem of non-IID and cross-domain data in federated learning for privacy-preserving collaborative training, representing an incremental advancement over prior methods.

The paper tackles the challenge of improving generalization and scalability in federated learning by proposing AdaptFED, which refines focal modulation with task-aware client embeddings and reduces communication overhead, achieving superior performance over state-of-the-art baselines on eight diverse datasets.

Federated learning (FL) has proven essential for privacy-preserving, collaborative training across distributed clients. Our prior work, TransFed, introduced a robust transformer-based FL framework that leverages a learn-to-adapt hypernetwork to generate personalized focal modulation layers per client, outperforming traditional methods in non-IID and cross-domain settings. In this extended version, we propose AdaptFED, where we deepen the investigation of focal modulation in generalizable FL by incorporating: (1) a refined adaptation strategy that integrates task-aware client embeddings to personalize modulation dynamics further, (2) enhanced theoretical bounds on adaptation performance, and (3) broader empirical validation across additional modalities, including time-series and multilingual data. We also introduce an efficient variant of TransFed that reduces server-client communication overhead via low-rank hypernetwork conditioning, enabling scalable deployment in resource-constrained environments. Extensive experiments on eight diverse datasets reaffirm the superiority of our method over state-of-the-art baselines, particularly in source-free and cross-task federated setups. Our findings not only extend the capabilities of focal modulation in FL but also pave the way for more adaptive, scalable, and generalizable transformer-based federated systems. The code is available at http://github.com/Tajamul21/TransFed

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes