LGAIJul 16, 2024

Enhancing Parameter Efficiency and Generalization in Large-Scale Models: A Regularized and Masked Low-Rank Adaptation Approach

arXiv:2407.12074v12 citationsh-index: 9Has Code
Originality Incremental advance
AI Analysis

This work addresses resource challenges in fine-tuning large models for applications like mobile systems, offering an incremental improvement over existing LoRA methods.

The paper tackled the problem of suboptimal performance and overfitting in Low-Rank Adaptation (LoRA) for fine-tuning large pre-trained models, proposing RM-LoRA, which achieved superior generalization with the same or lower trainable parameter budget across vision and language datasets.

Large pre-trained models, such as large language models (LLMs), present significant resource challenges for fine-tuning due to their extensive parameter sizes, especially for applications in mobile systems. To address this, Low-Rank Adaptation (LoRA) has been developed to reduce resource consumption while maintaining satisfactory fine-tuning results. Despite its effectiveness, the original LoRA method faces challenges of suboptimal performance and overfitting. This paper investigates the intrinsic dimension of the matrix updates approximated by the LoRA method and reveals the performance benefits of increasing this intrinsic dimension. By employing regularization and a gradient masking method that encourages higher intrinsic dimension, the proposed method, termed Regularized and Masked LoRA (RM-LoRA), achieves superior generalization performance with the same or lower trainable parameter budget compared to the original LoRA and its latest variants across various open-source vision and language datasets.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes