LGFeb 24, 2024

PRoLoRA: Partial Rotation Empowers More Parameter-Efficient LoRA

Oxford
arXiv:2402.16902v234 citationsh-index: 19ACL
AI Analysis

This work addresses the resource constraints in deploying adapted LLMs, offering a more efficient alternative to LoRA for practitioners, though it is incremental as it builds upon existing parameter-sharing methods.

The paper tackles the problem of high costs in serving multiple low-rank adaptations (LoRAs) for large language models by introducing PRoLoRA, a parameter-efficient finetuning method that uses partial rotation and intra-layer sharing, achieving better performance than LoRA with half the trainable parameters on instruction tuning datasets.

With the rapid scaling of large language models (LLMs), serving numerous low-rank adaptations (LoRAs) concurrently has become increasingly impractical, leading to unaffordable costs and necessitating more parameter-efficient finetuning methods. In this work, we introduce Partially Rotation-enhanced Low-Rank Adaptation (PRoLoRA), an intra-layer sharing mechanism comprising four essential components: broadcast reduction, rotation enhancement, partially-sharing refinement, and rectified initialization strategy. As a superset of LoRA, PRoLoRA retains its advantages, and effectively circumvent the drawbacks of peer parameter-sharing methods with superior model capacity, practical feasibility, and broad applicability. Empirical experiments demonstrate the remarkably higher parameter efficiency of PRoLoRA in both specific parameter budget and performance target scenarios, and its scalability to larger LLMs. Notably, with one time less trainable parameters, PRoLoRA still outperforms LoRA on multiple instruction tuning datasets. Subsequently, an ablation study is conducted to validate the necessity of individual components and highlight the superiority of PRoLoRA over three potential variants. Hopefully, the conspicuously higher parameter efficiency can establish PRoLoRA as a resource-friendly alternative to LoRA.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes