CLAIJan 19, 2024

Cross-lingual Editing in Multilingual Language Models

arXiv:2401.10521v2112 citationsHas CodeFindings
Originality Incremental advance
AI Analysis

This addresses the challenge of efficiently updating multilingual models for NLP practitioners, but it is incremental as it builds on existing model editing techniques.

The paper tackled the problem of updating multilingual language models efficiently by introducing cross-lingual model editing (XME), where editing a fact in one language propagates to others, and found that state-of-the-art methods perform poorly, especially across different script families like Latin and Indic.

The training of large language models (LLMs) necessitates substantial data and computational resources, and updating outdated LLMs entails significant efforts and resources. While numerous model editing techniques (METs) have emerged to efficiently update model outputs without retraining, their effectiveness in multilingual LLMs, where knowledge is stored in diverse languages, remains an underexplored research area. This research paper introduces the cross-lingual model editing (\textbf{XME}) paradigm, wherein a fact is edited in one language, and the subsequent update propagation is observed across other languages. To investigate the XME paradigm, we conducted experiments using BLOOM, mBERT, and XLM-RoBERTa using the two writing scripts: \textit{Latin} (English, French, and Spanish) and \textit{Indic} (Hindi, Gujarati, and Bengali). The results reveal notable performance limitations of state-of-the-art METs under the XME setting, mainly when the languages involved belong to two distinct script families. These findings highlight the need for further research and development of XME techniques to address these challenges. For more comprehensive information, the dataset used in this research and the associated code are publicly available at the following URL\url{https://github.com/lingo-iitgn/XME}.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes