LG AIFeb 27, 2025

Probabilistic Federated Prompt-Tuning with Non-IID and Imbalanced Data

Pei-Yau Weng, Minh Hoang, Lam M. Nguyen, My T. Thai, Tsui-Wei Weng, Trong Nghia Hoang

arXiv:2502.19752v117.916 citationsh-index: 24

Originality Incremental advance

AI Analysis

This addresses data heterogeneity issues in federated learning, which is an incremental improvement for distributed machine learning applications.

The paper tackles the problem of fine-tuning pre-trained models in federated learning with non-IID and imbalanced data by integrating federated learning with prompt-tuning, and introduces a probabilistic prompt aggregation method that substantially outperforms existing baselines on computer vision datasets.

Fine-tuning pre-trained models is a popular approach in machine learning for solving complex tasks with moderate data. However, fine-tuning the entire pre-trained model is ineffective in federated data scenarios where local data distributions are diversely skewed. To address this, we explore integrating federated learning with a more effective prompt-tuning method, optimizing for a small set of input prefixes to reprogram the pre-trained model's behavior. Our approach transforms federated learning into a distributed set modeling task, aggregating diverse sets of prompts to globally fine-tune the pre-trained model. We benchmark various baselines based on direct adaptations of existing federated model aggregation techniques and introduce a new probabilistic prompt aggregation method that substantially outperforms these baselines. Our reported results on a variety of computer vision datasets confirm that the proposed method is most effective to combat extreme data heterogeneity in federated learning.

View on arXiv PDF

Similar