CLOct 4, 2025

Prompt Balance Matters: Understanding How Imbalanced Few-Shot Learning Affects Multilingual Sense Disambiguation in LLMs

arXiv:2510.03762v12 citationsh-index: 5
Originality Incremental advance
AI Analysis

This addresses a bias problem in multilingual NLP for researchers and practitioners, though it is incremental as it builds on existing prompting methods.

The study examined how imbalanced few-shot examples affect Word Sense Disambiguation in multilingual LLMs, finding that such imbalances cause incorrect predictions in languages like German, Spanish, French, and Italian but not in English, with evaluations on GPT-4o and LLaMA-3.1-70B models.

Recent advances in Large Language Models (LLMs) have significantly reshaped the landscape of Natural Language Processing (NLP). Among the various prompting techniques, few-shot prompting has gained considerable attention for its practicality and effectiveness. This study investigates how few-shot prompting strategies impact the Word Sense Disambiguation (WSD) task, particularly focusing on the biases introduced by imbalanced sample distributions. We use the GLOSSGPT prompting method, an advanced approach for English WSD, to test its effectiveness across five languages: English, German, Spanish, French, and Italian. Our results show that imbalanced few-shot examples can cause incorrect sense predictions in multilingual languages, but this issue does not appear in English. To assess model behavior, we evaluate both the GPT-4o and LLaMA-3.1-70B models and the results highlight the sensitivity of multilingual WSD to sample distribution in few-shot settings, emphasizing the need for balanced and representative prompting strategies.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes