CLJun 26, 2023

Composing Parameter-Efficient Modules with Arithmetic Operations

Jinghan Zhang, Shiqi Chen, Junteng Liu, Junxian He

arXiv:2306.14870v221.4171 citationsh-index: 10Has Code

Originality Incremental advance

AI Analysis

This work addresses the need for flexible and efficient adaptation of pretrained language models without additional training, though it is incremental as it builds on existing PEFT methods.

The paper tackles the problem of integrating diverse skills from parameter-efficient finetuning (PEFT) modules by proposing linear arithmetic operations in weight space, resulting in new modules that significantly outperform existing ones across tasks like distribution generalization and domain transfer.

As an efficient alternative to conventional full finetuning, parameter-efficient finetuning (PEFT) is becoming the prevailing method to adapt pretrained language models. In PEFT, a lightweight module is learned on each dataset while the underlying pretrained language model remains unchanged, resulting in multiple compact modules representing diverse skills when applied to various domains and tasks. In this paper, we propose to compose these parameter-efficient modules through linear arithmetic operations in the weight space, thereby integrating different module capabilities. Specifically, we first define addition and negation operators for the module, and then further compose these two basic operators to perform flexible arithmetic. Our approach requires \emph{no additional training} and enables highly flexible module composition. We apply different arithmetic operations to compose the parameter-efficient modules for (1) distribution generalization, (2) multi-tasking, (3) unlearning, and (4) domain transfer. Additionally, we extend our approach to detoxify Alpaca-LoRA, the latest instruction-tuned large language model based on LLaMA. Empirical results demonstrate that our approach produces new and effective parameter-efficient modules that significantly outperform existing ones across all settings.

View on arXiv PDF Code

Similar