CLOct 11, 2024

The Impact of Visual Information in Chinese Characters: Evaluating Large Models' Ability to Recognize and Utilize Radicals

arXiv:2410.09013v311 citationsh-index: 4NAACL
Originality Incremental advance
AI Analysis

This addresses the problem of enhancing Chinese language processing for AI systems by leveraging sub-character information, though it is incremental as it builds on existing model capabilities.

The study investigated whether large language and vision-language models can recognize and utilize visual features like radicals in Chinese characters, finding they have limited knowledge but that incorporating radicals into prompts improves part-of-speech tagging performance.

The glyphic writing system of Chinese incorporates information-rich visual features in each character, such as radicals that provide hints about meaning or pronunciation. However, there has been no investigation into whether contemporary Large Language Models (LLMs) and Vision-Language Models (VLMs) can harness these sub-character features in Chinese through prompting. In this study, we establish a benchmark to evaluate LLMs' and VLMs' understanding of visual elements in Chinese characters, including radicals, composition structures, strokes, and stroke counts. Our results reveal that models surprisingly exhibit some, but still limited, knowledge of the visual information, regardless of whether images of characters are provided. To incite models' ability to use radicals, we further experiment with incorporating radicals into the prompts for Chinese language processing (CLP) tasks. We observe consistent improvement in Part-Of-Speech tagging when providing additional information about radicals, suggesting the potential to enhance CLP by integrating sub-character information.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes