LGJan 29

Per-parameter Task Arithmetic for Unlearning in Large Language Models

Chengyi Cai, Zesheng Ye, Jiangchao Yao, Jianzhong Qi, Bo Han, Xiaolu Zhang, Feng Liu, Jun Zhou

arXiv:2601.22030v11.4h-index: 15

Originality Highly original

AI Analysis

This addresses the need for efficient and precise unlearning of private information in large language models, offering a practical solution to mitigate over-forgetting while maintaining model performance.

The paper tackled the problem of over-forgetting in large language model unlearning by proposing a per-parameter task arithmetic mechanism that rescales task vectors based on parameter importance, resulting in improved forgetting effectiveness and model utility compared to standard methods.

In large language model (LLM) unlearning, private information is required to be removed. Task arithmetic unlearns by subtracting a specific task vector (TV)--defined as the parameter difference between a privacy-information-tuned model and the original model. While efficient, it can cause over-forgetting by disrupting parameters essential for retaining other information. Motivated by the observation that each parameter exhibits different importance for forgetting versus retention, we propose a per-parameter task arithmetic (PerTA) mechanism to rescale the TV, allowing per-parameter adjustment. These weights quantify the relative importance of each parameter for forgetting versus retention, estimated via gradients (i.e., PerTA-grad) or the diagonal Fisher information approximation (i.e., PerTA-fisher). Moreover, we discuss the effectiveness of PerTA, extend it to a more general form, and provide further analysis. Extensive experiments demonstrate that PerTA consistently improves upon standard TV, and in many cases surpasses widely used training-based unlearning methods in both forgetting effectiveness and overall model utility. By retaining the efficiency of task arithmetic while mitigating over-forgetting, PerTA offers a principled and practical framework for LLM unlearning.

View on arXiv PDF

Similar