CLJun 13, 2024

Sharing Matters: Analysing Neurons Across Languages and Tasks in LLMs

arXiv:2406.09265v336 citations
Originality Incremental advance
AI Analysis

This work addresses a gap in analyzing multilingual LLMs for researchers, though it is incremental as it builds on existing neuron analysis methods.

The study tackled the problem of understanding neuron activation sharing across tasks and languages in multilingual large language models (LLMs), finding that deactivating all-shared neurons significantly decreases performance and that activation patterns vary across tasks, models, and languages.

Large language models (LLMs) have revolutionized the field of natural language processing (NLP), and recent studies have aimed to understand their underlying mechanisms. However, most of this research is conducted within a monolingual setting, primarily focusing on English. Few studies have attempted to explore the internal workings of LLMs in multilingual settings. In this study, we aim to fill this research gap by examining how neuron activation is shared across tasks and languages. We classify neurons into four distinct categories based on their responses to a specific input across different languages: all-shared, partial-shared, specific, and non-activated. Building upon this categorisation, we conduct extensive experiments on three tasks across nine languages using several LLMs and present an in-depth analysis in this work. Our findings reveal that: (i) deactivating the all-shared neurons significantly decreases performance; (ii) the shared neurons play a vital role in generating responses, especially for the all-shared neurons; (iii) neuron activation patterns are highly sensitive and vary across tasks, LLMs, and languages. These findings shed light on the internal workings of multilingual LLMs and pave the way for future research. We release the code to foster research in this area.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes