LGAIFeb 21, 2025

R-LoRA: Randomized Multi-Head LoRA for Efficient Multi-Task Learning

arXiv:2502.15455v22 citationsh-index: 7Has CodeEMNLP
Originality Incremental advance
AI Analysis

This addresses the need for efficient fine-tuning of LLMs across multiple domains, though it appears incremental as it builds on LoRA.

The paper tackles the problem of LoRA underperforming in multi-task learning for large language models by proposing R-LoRA with Multi-Head Randomization, which improves performance while reducing GPU memory usage and training time.

Fine-tuning large language models (LLMs) is computationally expensive, and Low-Rank Adaptation (LoRA) provides a cost-effective solution by approximating weight updates through low-rank matrices. In real-world scenarios, LLMs are fine-tuned on data from multiple domains to perform tasks across various fields, embodying multi-task learning (MTL). LoRA often underperforms in such complex scenarios. To enhance LoRA's capability in multi-task learning, we propose R-LoRA, which incorporates Multi-Head Randomization. Multi-Head Randomization diversifies the head matrices through Multi-Head Dropout and Multi-Head Random Initialization, enabling more efficient learning of task-specific features while maintaining shared knowledge representation. Our approach not only improves performance in MTL but also reduces GPU memory usage and training time. Experiments show that R-LoRA's gains stem from increased diversity in the head matrices, demonstrating its effectiveness for multi-task learning. The code is available at https://github.com/jinda-liu/R-LoRA

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes