CLOct 4, 2025

Prompt Balance Matters: Understanding How Imbalanced Few-Shot Learning Affects Multilingual Sense Disambiguation in LLMs

Deshan Sumanathilaka, Nicholas Micallef, Julian Hough

arXiv:2510.03762v16.72 citationsh-index: 5

Originality Incremental advance

AI Analysis

This addresses a bias problem in multilingual NLP for researchers and practitioners, though it is incremental as it builds on existing prompting methods.

The study examined how imbalanced few-shot examples affect Word Sense Disambiguation in multilingual LLMs, finding that such imbalances cause incorrect predictions in languages like German, Spanish, French, and Italian but not in English, with evaluations on GPT-4o and LLaMA-3.1-70B models.

Recent advances in Large Language Models (LLMs) have significantly reshaped the landscape of Natural Language Processing (NLP). Among the various prompting techniques, few-shot prompting has gained considerable attention for its practicality and effectiveness. This study investigates how few-shot prompting strategies impact the Word Sense Disambiguation (WSD) task, particularly focusing on the biases introduced by imbalanced sample distributions. We use the GLOSSGPT prompting method, an advanced approach for English WSD, to test its effectiveness across five languages: English, German, Spanish, French, and Italian. Our results show that imbalanced few-shot examples can cause incorrect sense predictions in multilingual languages, but this issue does not appear in English. To assess model behavior, we evaluate both the GPT-4o and LLaMA-3.1-70B models and the results highlight the sensitivity of multilingual WSD to sample distribution in few-shot settings, emphasizing the need for balanced and representative prompting strategies.

View on arXiv PDF

Similar