CLSep 27, 2024

"Why" Has the Least Side Effect on Model Editing

Tsung-Hsuan Pan, Chung-Chi Chen, Hen-Hsen Huang, Hsin-Hsi Chen

arXiv:2409.18679v11.91 citationsh-index: 14

Originality Synthesis-oriented

AI Analysis

This work addresses the problem of unintended side effects in model editing for LLMs, providing incremental insights for experimental design.

The paper investigates how question type affects side effects in model editing for large language models, finding that performance degradation varies significantly across question types and that insights from smaller models do not always apply to larger ones.

Training large language models (LLMs) from scratch is an expensive endeavor, particularly as world knowledge continually evolves. To maintain relevance and accuracy of LLMs, model editing has emerged as a pivotal research area. While these methods hold promise, they can also produce unintended side effects. Their underlying factors and causes remain largely unexplored. This paper delves into a critical factor-question type-by categorizing model editing questions. Our findings reveal that the extent of performance degradation varies significantly across different question types, providing new insights for experimental design in knowledge editing. Furthermore, we investigate whether insights from smaller models can be extrapolated to larger models. Our results indicate discrepancies in findings between models of different sizes, suggesting that insights from smaller models may not necessarily apply to larger models. Additionally, we examine the impact of batch size on side effects, discovering that increasing the batch size can mitigate performance drops.

View on arXiv PDF

Similar