LGJan 29

Per-parameter Task Arithmetic for Unlearning in Large Language Models

arXiv:2601.22030v1h-index: 15
Originality Highly original
AI Analysis

This addresses the need for efficient and precise unlearning of private information in large language models, offering a practical solution to mitigate over-forgetting while maintaining model performance.

The paper tackled the problem of over-forgetting in large language model unlearning by proposing a per-parameter task arithmetic mechanism that rescales task vectors based on parameter importance, resulting in improved forgetting effectiveness and model utility compared to standard methods.

In large language model (LLM) unlearning, private information is required to be removed. Task arithmetic unlearns by subtracting a specific task vector (TV)--defined as the parameter difference between a privacy-information-tuned model and the original model. While efficient, it can cause over-forgetting by disrupting parameters essential for retaining other information. Motivated by the observation that each parameter exhibits different importance for forgetting versus retention, we propose a per-parameter task arithmetic (PerTA) mechanism to rescale the TV, allowing per-parameter adjustment. These weights quantify the relative importance of each parameter for forgetting versus retention, estimated via gradients (i.e., PerTA-grad) or the diagonal Fisher information approximation (i.e., PerTA-fisher). Moreover, we discuss the effectiveness of PerTA, extend it to a more general form, and provide further analysis. Extensive experiments demonstrate that PerTA consistently improves upon standard TV, and in many cases surpasses widely used training-based unlearning methods in both forgetting effectiveness and overall model utility. By retaining the efficiency of task arithmetic while mitigating over-forgetting, PerTA offers a principled and practical framework for LLM unlearning.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes