CL AI LGJan 9, 2024

RoSA: Accurate Parameter-Efficient Fine-Tuning via Robust Adaptation

Mahdi Nikdan, Soroush Tabesh, Elvir Crnčević, Dan Alistarh

arXiv:2401.04679v718.153 citationsh-index: 41Has CodeICML

Originality Incremental advance

AI Analysis

This addresses the need for efficient fine-tuning in resource-constrained settings, offering an incremental improvement over prior hybrid methods.

The paper tackles the problem of parameter-efficient fine-tuning for large language models by introducing RoSA, a method that combines low-rank and sparse components to approximate full fine-tuning, showing it outperforms existing methods like LoRA and recovers full fine-tuning performance on some tasks with concrete gains in accuracy.

We investigate parameter-efficient fine-tuning (PEFT) methods that can provide good accuracy under limited computational and memory budgets in the context of large language models (LLMs). We present a new PEFT method called Robust Adaptation (RoSA) inspired by robust principal component analysis that jointly trains $\textit{low-rank}$ and $\textit{highly-sparse}$ components on top of a set of fixed pretrained weights to efficiently approximate the performance of a full-fine-tuning (FFT) solution. Across a series of challenging generative tasks such as grade-school math and SQL query generation, which require fine-tuning for good performance, we show that RoSA outperforms LoRA, pure sparse fine-tuning, and alternative hybrid methods at the same parameter budget, and can even recover the performance of FFT on some tasks. We provide system support for RoSA to complement the training algorithm, specifically in the form of sparse GPU kernels which enable memory- and computationally-efficient training, and show that it is also compatible with low-precision base weights, resulting in the first joint representation combining quantization, low-rank and sparse approximations. Our code is available at https://github.com/IST-DASLab/RoSA.

View on arXiv PDF Code

Similar