LG AIApr 9, 2024

Automated Federated Pipeline for Parameter-Efficient Fine-Tuning of Large Language Models

Zihan Fang, Zheng Lin, Zhe Chen, Xianhao Chen, Yue Gao, Yuguang Fang

arXiv:2404.06448v228.967 citationsh-index: 23

Originality Incremental advance

AI Analysis

This work addresses privacy-preserving fine-tuning of LLMs for downstream tasks in resource-constrained edge environments, representing an incremental improvement over existing federated learning methods.

The paper tackles the challenge of fine-tuning large language models (LLMs) on private data using federated learning, which faces high computational and communication costs and resource heterogeneity across edge servers, by proposing FedPipe, an automated federated pipeline that reduces training costs without inference latency and achieves higher accuracy than state-of-the-art benchmarks.

Recently, there has been a surge in the development of advanced intelligent generative content (AIGC), especially large language models (LLMs). However, for many downstream tasks, it is necessary to fine-tune LLMs using private data. While federated learning offers a promising privacy-preserving solution to LLM fine-tuning, the substantial size of an LLM, combined with high computational and communication demands, makes it hard to apply to downstream tasks. More importantly, private edge servers often possess varying computing and network resources in real-world scenarios, introducing additional complexities to LLM fine-tuning. To tackle these problems, we design and implement an automated federated pipeline, named FedPipe, to fine-tune LLMs with minimal training cost but without adding any inference latency. FedPipe firstly identifies the weights to be fine-tuned based on their contributions to the LLM training. It then configures a low-rank adapter for each selected weight to train local low-rank adapters on an edge server, and aggregate local adapters of all edge servers to fine-tune the whole LLM. Finally, it appropriately quantizes the parameters of LLM to reduce memory space according to the requirements of edge servers. Extensive experiments demonstrate that FedPipe expedites the model training and achieves higher accuracy than state-of-the-art benchmarks.

View on arXiv PDF

Similar