CLAILGSep 22, 2024

Exploring Multilingual Probing in Large Language Models: A Cross-Language Analysis

arXiv:2409.14459v213 citationsh-index: 18Has Code
AI Analysis

This work addresses the issue of language disparities in LLMs for researchers and practitioners, but it is incremental as it applies existing probing methods to new multilingual data.

The paper tackled the problem of limited probing studies on multilingual large language models (LLMs) by extending probing techniques to diverse languages, revealing a consistent performance gap where high-resource languages achieve significantly higher probing accuracy than low-resource ones.

Probing techniques for large language models (LLMs) have primarily focused on English, overlooking the vast majority of the world's languages. In this paper, we extend these probing methods to a multilingual context, investigating the behaviors of LLMs across diverse languages. We conduct experiments on several open-source LLM models, analyzing probing accuracy, trends across layers, and similarities between probing vectors for multiple languages. Our key findings reveal: (1) a consistent performance gap between high-resource and low-resource languages, with high-resource languages achieving significantly higher probing accuracy; (2) divergent layer-wise accuracy trends, where high-resource languages show substantial improvement in deeper layers similar to English; and (3) higher representational similarities among high-resource languages, with low-resource languages demonstrating lower similarities both among themselves and with high-resource languages. These results highlight significant disparities in LLMs' multilingual capabilities and emphasize the need for improved modeling of low-resource languages.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes