Uncovering the Fragility of Trustworthy LLMs through Chinese Textual Ambiguity
This work addresses a critical issue for deploying LLMs in real-world applications where linguistic ambiguity is common, highlighting a fundamental limitation that calls for improved uncertainty handling, though it is incremental as it builds on existing trustworthiness research.
The researchers tackled the problem of how large language models (LLMs) handle ambiguous narrative text, particularly in Chinese, by creating a benchmark dataset and finding that LLMs exhibit significant fragility, such as inability to reliably distinguish ambiguous from unambiguous text, overconfidence in single interpretations, and overthinking, with behavior differing substantially from humans.
In this work, we study a critical research problem regarding the trustworthiness of large language models (LLMs): how LLMs behave when encountering ambiguous narrative text, with a particular focus on Chinese textual ambiguity. We created a benchmark dataset by collecting and generating ambiguous sentences with context and their corresponding disambiguated pairs, representing multiple possible interpretations. These annotated examples are systematically categorized into 3 main categories and 9 subcategories. Through experiments, we discovered significant fragility in LLMs when handling ambiguity, revealing behavior that differs substantially from humans. Specifically, LLMs cannot reliably distinguish ambiguous text from unambiguous text, show overconfidence in interpreting ambiguous text as having a single meaning rather than multiple meanings, and exhibit overthinking when attempting to understand the various possible meanings. Our findings highlight a fundamental limitation in current LLMs that has significant implications for their deployment in real-world applications where linguistic ambiguity is common, calling for improved approaches to handle uncertainty in language understanding. The dataset and code are publicly available at this GitHub repository: https://github.com/ictup/LLM-Chinese-Textual-Disambiguation.