LGAICLMar 21, 2024

A Unified Framework for Model Editing

arXiv:2403.14236v573 citationsh-index: 25EMNLP
Originality Synthesis-oriented
AI Analysis

This work provides a unified framework for model editing, which is incremental as it builds on existing algorithms to clarify their equivalence and extend capabilities.

The paper unifies ROME and MEMIT model editing algorithms under a single preservation-memorization objective and introduces EMMET, a new algorithm that enables batched editing up to 10,000 edits with performance similar to MEMIT.

ROME and MEMIT are largely believed to be two different model editing algorithms, with the major difference between them being the ability to perform batched edits. In this paper, we unify these two algorithms under a single conceptual umbrella, optimizing for the same goal, which we call the preservation-memorization objective. ROME uses an equality constraint to optimize this objective to perform one edit at a time, whereas MEMIT employs a more flexible least-square constraint that allows for batched edits. We generalize ROME and enable batched editing with equality constraint in the form of EMMET - an Equality-constrained Mass Model Editing algorithm for Transformers, a new batched memory-editing algorithm. EMMET can perform batched-edits up to a batch-size of 10,000, with very similar performance to MEMIT across multiple dimensions. With the introduction of EMMET, we truly unify ROME and MEMIT and show that both algorithms are equivalent in terms of their optimization objective, their abilities (singular and batched editing), their model editing performance and their limitations.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes