CLAIJul 21, 2025

How and Where to Translate? The Impact of Translation Strategies in Cross-lingual LLM Prompting

arXiv:2507.22923v12 citationsh-index: 5
Originality Incremental advance
AI Analysis

This work addresses the challenge of effective multilingual resource sharing for low-resource languages, though it is incremental in nature.

The paper tackled the problem of varying LLM performance across languages in multilingual RAG systems by evaluating prompt translation strategies, finding that an optimized strategy significantly improves knowledge sharing and downstream classification performance.

Despite advances in the multilingual capabilities of Large Language Models (LLMs), their performance varies substantially across different languages and tasks. In multilingual retrieval-augmented generation (RAG)-based systems, knowledge bases (KB) are often shared from high-resource languages (such as English) to low-resource ones, resulting in retrieved information from the KB being in a different language than the rest of the context. In such scenarios, two common practices are pre-translation to create a mono-lingual prompt and cross-lingual prompting for direct inference. However, the impact of these choices remains unclear. In this paper, we systematically evaluate the impact of different prompt translation strategies for classification tasks with RAG-enhanced LLMs in multilingual systems. Experimental results show that an optimized prompting strategy can significantly improve knowledge sharing across languages, therefore improve the performance on the downstream classification task. The findings advocate for a broader utilization of multilingual resource sharing and cross-lingual prompt optimization for non-English languages, especially the low-resource ones.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes