CLNov 12, 2023

Tunable Soft Prompts are Messengers in Federated Learning

arXiv:2311.06805v1136 citationsh-index: 18Has Code
Originality Incremental advance
AI Analysis

This addresses privacy and efficiency challenges in federated learning for organizations using proprietary models, though it is an incremental improvement over existing methods.

The paper tackles the lack of model privacy in federated learning, especially for fine-tuning proprietary large language models, by proposing an approach that uses tunable soft prompts as messengers for information exchange, which protects the global model and reduces communication and computation costs.

Federated learning (FL) enables multiple participants to collaboratively train machine learning models using decentralized data sources, alleviating privacy concerns that arise from directly sharing local data. However, the lack of model privacy protection in FL becomes an unneglectable challenge, especially when people want to federally finetune models based on a proprietary large language model. In this study, we propose a novel FL training approach that accomplishes information exchange among participants via tunable soft prompts. These soft prompts, updated and transmitted between the server and clients, assume the role of the global model parameters and serve as messengers to deliver useful knowledge from the local data and global model. As the global model itself is not required to be shared and the local training is conducted based on an auxiliary model with fewer parameters than the global model, the proposed approach provides protection for the global model while reducing communication and computation costs in FL. Extensive experiments show the effectiveness of the proposed approach compared to several baselines. We have released the source code at \url{https://github.com/alibaba/FederatedScope/tree/fedsp/federatedscope/nlp/fedsp}.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes