LGAIFeb 27, 2025

Probabilistic Federated Prompt-Tuning with Non-IID and Imbalanced Data

arXiv:2502.19752v116 citationsh-index: 24
Originality Incremental advance
AI Analysis

This addresses data heterogeneity issues in federated learning, which is an incremental improvement for distributed machine learning applications.

The paper tackles the problem of fine-tuning pre-trained models in federated learning with non-IID and imbalanced data by integrating federated learning with prompt-tuning, and introduces a probabilistic prompt aggregation method that substantially outperforms existing baselines on computer vision datasets.

Fine-tuning pre-trained models is a popular approach in machine learning for solving complex tasks with moderate data. However, fine-tuning the entire pre-trained model is ineffective in federated data scenarios where local data distributions are diversely skewed. To address this, we explore integrating federated learning with a more effective prompt-tuning method, optimizing for a small set of input prefixes to reprogram the pre-trained model's behavior. Our approach transforms federated learning into a distributed set modeling task, aggregating diverse sets of prompts to globally fine-tune the pre-trained model. We benchmark various baselines based on direct adaptations of existing federated model aggregation techniques and introduce a new probabilistic prompt aggregation method that substantially outperforms these baselines. Our reported results on a variety of computer vision datasets confirm that the proposed method is most effective to combat extreme data heterogeneity in federated learning.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes