CLApr 8

Enhancing Multilingual RAG Systems with Debiased Language Preference-Guided Query Fusion

arXiv:2601.0295622.01 citationsh-index: 5
Predicted impact top 72% in CL · last 90 daysOriginality Incremental advance
AI Analysis

This work addresses biases in multilingual AI systems, offering a more accurate assessment and improved performance for cross-lingual applications, though it is incremental as it builds on existing mRAG methods.

The study tackled the problem of perceived English preference in multilingual RAG systems by identifying structural biases in evaluation benchmarks, and proposed a debiased metric and framework that outperformed baselines across diverse languages.

Multilingual Retrieval-Augmented Generation (mRAG) systems often exhibit a perceived preference for high-resource languages, particularly English, resulting in the widespread adoption of English pivoting. While prior studies attribute this advantage to the superior English-centric capabilities of Large Language Models (LLMs), we find that such measurements are significantly distorted by structural priors inherent in evaluation benchmarks. Specifically, we identify exposure bias and a gold availability prior-both driven by the disproportionate concentration of resources in English-as well as cultural priors rooted in topic locality, as factors that hinder accurate assessment of genuine language preference. To address these biases, we propose DeLP (Debiased Language Preference), a calibrated metric designed to explicitly factor out these structural confounds. Our analysis using DeLP reveals that the previously reported English preference is largely a byproduct of evidence distribution rather than an inherent model bias. Instead, we find that retrievers fundamentally favor monolingual alignment between the query and the document language. Building on this insight, we introduce DELTA (DEbiased Language preference-guided Text Augmentation), a lightweight and efficient mRAG framework that strategically leverages monolingual alignment to optimize cross-lingual retrieval and generation. Experimental results demonstrate that DELTA consistently outperforms English pivoting and mRAG baselines across diverse languages.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes