CLSep 25, 2024

Pruning Multilingual Large Language Models for Multilingual Inference

arXiv:2409.16911v225 citationsh-index: 6
Originality Incremental advance
AI Analysis

This work addresses the problem of improving multilingual inference for non-English languages, representing an incremental advancement by leveraging existing translation capabilities.

This paper tackles the performance disparity between English and non-English languages in multilingual large language models (MLLMs) by pruning weights not associated with large magnitude features critical for translation, resulting in enhanced zero-shot performance in non-English languages.

Multilingual large language models (MLLMs), trained on multilingual balanced data, demonstrate better zero-shot learning performance in non-English languages compared to large language models trained on English-dominant data. However, the disparity in performance between English and non-English languages remains a challenge yet to be fully addressed. A distinctive characteristic of MLLMs is their high-quality translation capabilities, indicating an acquired proficiency in aligning between languages. This study explores how to enhance the zero-shot performance of MLLMs in non-English languages by leveraging their alignment capability between English and non-English languages. To achieve this, we first analyze the behavior of MLLMs when performing translation and reveal that there are large magnitude features that play a critical role in the translation process. Inspired by these findings, we retain the weights associated with operations involving the large magnitude features and prune other weights to force MLLMs to rely on these features for tasks beyond translation. We empirically demonstrate that this pruning strategy can enhance the MLLMs' performance in non-English language.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes