CL AIMar 11

Mitigating Translationese Bias in Multilingual LLM-as-a-Judge via Disentangled Information Bottleneck

Hongbin Zhang, Kehai Chen, Xuefen Bai, Youcheng Pan, Yang Xiang, Jinpeng Wang, Min Zhang

arXiv:2603.10351v117.4h-index: 4

Predicted impact top 51% in CL · last 90 daysOriginality Incremental advance

AI Analysis

This addresses a systematic bias in multilingual evaluation for LLMs, particularly affecting low-resource languages, but is incremental as it builds on existing fine-tuning and information bottleneck methods.

The paper tackled the problem of translationese bias in multilingual LLM-as-a-Judge systems, where models favor machine-translated text over human-authored references, especially in low-resource languages, and proposed DIBJudge, a fine-tuning framework that mitigates this bias and outperforms baselines in evaluations.

Large language models (LLMs) have become a standard for multilingual evaluation, yet they exhibit a severe systematic translationese bias. In this paper, translationese bias is characterized as LLMs systematically favoring machine-translated text over human-authored references, particularly in low-resource languages. We attribute this bias to spurious correlations with (i) latent manifold alignment with English and (ii) cross-lingual predictability. To mitigate this bias, we propose DIBJudge, a robust fine-tuning framework that learns a minimally sufficient, judgment-critical representation via variational information compression, while explicitly isolating spurious factors into the dedicated bias branch. Furthermore, we incorporate a cross-covariance penalty that explicitly suppresses statistical dependence between robust and bias representations, thereby encouraging effective disentanglement. Extensive evaluations on multilingual reward modeling benchmarks and a dedicated translationese bias evaluation suite demonstrate that the proposed DIBJudge consistently outperforms strong baselines and substantially mitigates translationese bias.

View on arXiv PDF

Similar