CLSep 25, 2024

Pruning Multilingual Large Language Models for Multilingual Inference

Hwichan Kim, Jun Suzuki, Tosho Hirasawa, Mamoru Komachi

arXiv:2409.16911v213.525 citationsh-index: 6Has Code

Originality Incremental advance

AI Analysis

This work addresses the problem of improving multilingual inference for non-English languages, representing an incremental advancement by leveraging existing translation capabilities.

This paper tackles the performance disparity between English and non-English languages in multilingual large language models (MLLMs) by pruning weights not associated with large magnitude features critical for translation, resulting in enhanced zero-shot performance in non-English languages.

Multilingual large language models (MLLMs), trained on multilingual balanced data, demonstrate better zero-shot learning performance in non-English languages compared to large language models trained on English-dominant data. However, the disparity in performance between English and non-English languages remains a challenge yet to be fully addressed. A distinctive characteristic of MLLMs is their high-quality translation capabilities, indicating an acquired proficiency in aligning between languages. This study explores how to enhance the zero-shot performance of MLLMs in non-English languages by leveraging their alignment capability between English and non-English languages. To achieve this, we first analyze the behavior of MLLMs when performing translation and reveal that there are large magnitude features that play a critical role in the translation process. Inspired by these findings, we retain the weights associated with operations involving the large magnitude features and prune other weights to force MLLMs to rely on these features for tasks beyond translation. We empirically demonstrate that this pruning strategy can enhance the MLLMs' performance in non-English language.

View on arXiv PDF Code

Similar