LGJan 26, 2025

Decentralized Low-Rank Fine-Tuning of Large Language Models

Sajjad Ghiasvand, Mahnoosh Alizadeh, Ramtin Pedarsani

arXiv:2501.15361v519.715 citationsh-index: 32Proceedings of the 1st Workshop for Research on Agent Language Models (REALM 2025)

Originality Incremental advance

AI Analysis

This work addresses the challenge of scalable and efficient LLM fine-tuning for real-world distributed applications, though it is incremental as it adapts existing methods to a decentralized context.

The paper tackles the problem of fine-tuning large language models in decentralized, privacy-sensitive settings by proposing Dec-LoRA, a decentralized algorithm based on LoRA, which achieves performance comparable to centralized methods under conditions like data heterogeneity and quantization constraints.

While parameter-efficient fine-tuning (PEFT) techniques like Low-Rank Adaptation (LoRA) offer computationally efficient adaptations of Large Language Models (LLMs), their practical deployment often assumes centralized data and training environments. However, real-world scenarios frequently involve distributed, privacy-sensitive datasets that require decentralized solutions. Federated learning (FL) addresses data privacy by coordinating model updates across clients, but it is typically based on centralized aggregation through a parameter server, which can introduce bottlenecks and communication constraints. Decentralized learning, in contrast, eliminates this dependency by enabling direct collaboration between clients, improving scalability and efficiency in distributed environments. Despite its advantages, decentralized LLM fine-tuning remains underexplored. In this work, we propose Dec-LoRA, a decentralized fine-tuning algorithm for LLMs based on LoRA. Through extensive experiments on BERT and LLaMA-2 models, we demonstrate that Dec-LoRA achieves performance comparable to centralized LoRA under various conditions, including data heterogeneity and quantization constraints. Additionally, we provide a rigorous theoretical guarantee proving the convergence of our algorithm to a stationary point for non-convex and smooth loss functions. These findings highlight the potential of Dec-LoRA for scalable LLM fine-tuning in decentralized environments.

View on arXiv PDF

Similar