CLJan 30, 2025

CALM: Unleashing the Cross-Lingual Self-Aligning Ability of Language Model Question Answering

arXiv:2501.18457v213 citationsh-index: 13NAACL
Originality Incremental advance
AI Analysis

This addresses the problem of inconsistent multilingual responses in LLMs for question answering, which is incremental as it builds on existing alignment techniques like DPO.

The paper tackles performance disparities in large language models (LLMs) when answering culture-independent questions across languages by proposing CALM, a method that enhances cross-lingual knowledge alignment, resulting in improved accuracy and consistency on datasets like MEDQA and X-CSQA.

Large Language Models (LLMs) are pretrained on extensive multilingual corpora to acquire both language-specific cultural knowledge and general knowledge. Ideally, while LLMs should provide consistent responses to culture-independent questions across languages, we observe significant performance disparities. To address this, we explore the Cross-Lingual Self-Aligning ability of Language Models (CALM) to align knowledge across languages. Specifically, for a given question, we sample multiple responses across different languages and select the most self-consistent response as the target, leaving the remaining responses as negative examples. We then employ direct preference optimization (DPO) to align the model's knowledge across different languages. Evaluations on the MEDQA and X-CSQA datasets demonstrate CALM's effectiveness in enhancing cross-lingual knowledge question answering, both in zero-shot and retrieval-augmented settings. We also found that increasing the number of languages involved in CALM training leads to higher accuracy and consistency. We offer a qualitative analysis of how cross-lingual consistency can enhance knowledge alignment and explore the method's generalizability.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes