LGAIMar 17, 2025

Federated Continual Instruction Tuning

arXiv:2503.12897v210 citationsh-index: 34Has Code
Originality Incremental advance
AI Analysis

This addresses the problem of efficient and scalable instruction tuning for researchers in AI, though it is incremental as it builds on existing federated and continual learning techniques.

The paper tackles the challenge of federated learning for large multimodal models with continuously arriving tasks, introducing the Federated Continual Instruction Tuning (FCIT) benchmark and a method that improves performance across data heterogeneity and catastrophic forgetting.

A vast amount of instruction tuning data is crucial for the impressive performance of Large Multimodal Models (LMMs), but the associated computational costs and data collection demands during supervised fine-tuning make it impractical for most researchers. Federated learning (FL) has the potential to leverage all distributed data and training resources to reduce the overhead of joint training. However, most existing methods assume a fixed number of tasks, while in real-world scenarios, clients continuously encounter new knowledge and often struggle to retain old tasks due to memory constraints. In this work, we introduce the Federated Continual Instruction Tuning (FCIT) benchmark to model this real-world challenge. Our benchmark includes two realistic scenarios, encompassing four different settings and twelve carefully curated instruction tuning datasets. To address the challenges posed by FCIT, we propose dynamic knowledge organization to effectively integrate updates from different tasks during training and subspace selective activation to allocate task-specific output during inference. Extensive experimental results demonstrate that our proposed method significantly enhances model performance across varying levels of data heterogeneity and catastrophic forgetting. Code and dataset are released at https://github.com/Ghy0501/FCIT.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes