CLMar 4, 2025

Multilingual Relative Clause Attachment Ambiguity Resolution in Large Language Models

arXiv:2503.02971v12 citationsh-index: 2Has CodePACLIC
Originality Synthesis-oriented
AI Analysis

It addresses variability in LLMs' linguistic ambiguity resolution for multilingual applications, highlighting incremental improvements needed for non-European languages.

This study evaluated how large language models (LLMs) resolve relative clause attachment ambiguities across multiple languages, finding they performed well in Indo-European languages but struggled in Asian languages like Japanese and Korean, often defaulting to incorrect English translations.

This study examines how large language models (LLMs) resolve relative clause (RC) attachment ambiguities and compares their performance to human sentence processing. Focusing on two linguistic factors, namely the length of RCs and the syntactic position of complex determiner phrases (DPs), we assess whether LLMs can achieve human-like interpretations amid the complexities of language. In this study, we evaluated several LLMs, including Claude, Gemini and Llama, in multiple languages: English, Spanish, French, German, Japanese, and Korean. While these models performed well in Indo-European languages (English, Spanish, French, and German), they encountered difficulties in Asian languages (Japanese and Korean), often defaulting to incorrect English translations. The findings underscore the variability in LLMs' handling of linguistic ambiguities and highlight the need for model improvements, particularly for non-European languages. This research informs future enhancements in LLM design to improve accuracy and human-like processing in diverse linguistic environments.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes