CLFeb 2

Language Steering for Multilingual In-Context Learning

arXiv:2602.02326v10.6h-index: 4

Originality Incremental advance

AI Analysis

This addresses the problem of inferior multilingual performance in LLMs for users relying on in-context learning, though it is incremental as it builds on existing activation steering techniques.

The paper tackled the performance disparity of multilingual large language models in non-English in-context learning by proposing language vectors, a training-free method that steers model activations towards target languages, resulting in consistent improvements across 19 languages and three models.

While multilingual large language models have gained widespread adoption, their performance on non-English languages remains substantially inferior to English. This disparity is particularly evident in in-context learning scenarios, where providing demonstrations in English but testing on non-English inputs leads to significant performance degradation. In this paper, we hypothesize that LLMs develop a universal semantic space for understanding languages, where different languages are encoded as distinct directions within this space. Based on this hypothesis, we propose language vectors -- a training-free language steering approach that leverages activation differences between source and target languages to guide model behavior. We steer the model generations by adding the vector to the intermediate model activations during inference. This is done to make the model's internal representations shift towards the target language space without any parameter updates. We evaluate our method across three datasets and test on a total of 19 languages on three different models. Our results show consistent improvements on multilingual in-context learning over baselines across all tasks and languages tested. Beyond performance gains, hierarchical clustering of steering vectors reveals meaningful linguistic structure aligned with language families. These vectors also successfully transfer across tasks, demonstrating that these representations are task-agnostic.

View on arXiv PDF

Similar