LGAICLFeb 16, 2024

Model Editing by Standard Fine-Tuning

arXiv:2402.11078v336 citationsh-index: 4ACL
Originality Incremental advance
AI Analysis

This provides a simple, architecture-agnostic solution for model editing, though it is incremental as it builds on existing fine-tuning techniques.

The authors tackled the problem of model editing by showing that standard fine-tuning, with minor modifications, can achieve competitive performance, matching or outperforming specialized methods on ZsRE and CounterFact datasets in terms of edit score.

Standard fine-tuning is considered not as effective as specialized methods for model editing due to its comparatively poor performance. However, it is simple, agnostic to the architectural details of the model being edited, and able to leverage advances in standard training techniques with no additional work (e.g., black-box PEFT for computational efficiency), making it an appealing choice for a model editor. In this work, we show that standard fine-tuning alone can yield competitive model editing performance with two minor modifications. First, we optimize the conditional likelihood rather than the full likelihood. Second, in addition to the typical practice of training on randomly paraphrased edit prompts to encourage generalization, we also train on random or similar unedited facts to encourage locality. Our experiments on the ZsRE and CounterFact datasets demonstrate that these simple modifications allow standard fine-tuning to match or outperform highly specialized editors in terms of edit score.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes