CLSep 27, 2024

"Why" Has the Least Side Effect on Model Editing

arXiv:2409.18679v11 citationsh-index: 14
Originality Synthesis-oriented
AI Analysis

This work addresses the problem of unintended side effects in model editing for LLMs, providing incremental insights for experimental design.

The paper investigates how question type affects side effects in model editing for large language models, finding that performance degradation varies significantly across question types and that insights from smaller models do not always apply to larger ones.

Training large language models (LLMs) from scratch is an expensive endeavor, particularly as world knowledge continually evolves. To maintain relevance and accuracy of LLMs, model editing has emerged as a pivotal research area. While these methods hold promise, they can also produce unintended side effects. Their underlying factors and causes remain largely unexplored. This paper delves into a critical factor-question type-by categorizing model editing questions. Our findings reveal that the extent of performance degradation varies significantly across different question types, providing new insights for experimental design in knowledge editing. Furthermore, we investigate whether insights from smaller models can be extrapolated to larger models. Our results indicate discrepancies in findings between models of different sizes, suggesting that insights from smaller models may not necessarily apply to larger models. Additionally, we examine the impact of batch size on side effects, discovering that increasing the batch size can mitigate performance drops.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes