CLDec 3, 2025

Dual LoRA: Enhancing LoRA with Magnitude and Direction Updates

arXiv:2512.03402v1h-index: 10
Originality Incremental advance
AI Analysis

This work addresses the problem of inefficient fine-tuning for large language models, offering an incremental improvement over existing parameter-efficient methods.

The paper tackles the performance limitations of Low-rank adaptation (LoRA) in fine-tuning large language models by proposing Dual LoRA, which separates updates into magnitude and direction groups to better simulate full fine-tuning, resulting in consistent outperformance over LoRA and its variants across various NLP tasks.

Low-rank adaptation (LoRA) is one of the most popular methods among parameter-efficient fine-tuning (PEFT) methods to adapt pre-trained large language models (LLMs) to specific downstream tasks. However, the model trained based on LoRA often has an unsatisfactory performance due to its low-rank assumption. In this paper, we propose a novel method called Dual LoRA to improve the performance by incorporating an inductive bias into the original LoRA. Specifically, we separate low-rank matrices into two groups: the magnitude group to control whether or not and how far we should update a parameter and the direction group to decide whether this parameter should move forward or backward, to better simulate the parameter updating process of the full fine-tuning based on gradient-based optimization algorithms. We show that this can be simply achieved by adding a ReLU function to the magnitude group and a sign function to the direction group. We conduct several experiments over a wide range of NLP tasks, including natural language generation (NLG), understanding (NLU), and commonsense reasoning datasets on GPT-2, RoBERTa, DeBERTa, and LLaMA-1/2/3 as baseline models. The results show that we consistently outperform LoRA and its state-of-the-art variants with the same number of trainable parameters.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes