CLAISep 16, 2022

ConFiguRe: Exploring Discourse-level Chinese Figures of Speech

Peking U
arXiv:2209.07678v1583 citationsh-index: 42Has Code
Originality Incremental advance
AI Analysis

This work addresses a gap in computational linguistics for Chinese figures of speech, but it is incremental as it builds on existing token- and sentence-level approaches by extending to discourse-level analysis.

The authors tackled the challenge of recognizing discourse-level Chinese figures of speech by introducing the concept of figurative units and creating the ConFiguRe dataset, which includes three tasks (extraction, classification, and recognition) that proved challenging for state-of-the-art models, requiring further research.

Figures of speech, such as metaphor and irony, are ubiquitous in literature works and colloquial conversations. This poses great challenge for natural language understanding since figures of speech usually deviate from their ostensible meanings to express deeper semantic implications. Previous research lays emphasis on the literary aspect of figures and seldom provide a comprehensive exploration from a view of computational linguistics. In this paper, we first propose the concept of figurative unit, which is the carrier of a figure. Then we select 12 types of figures commonly used in Chinese, and build a Chinese corpus for Contextualized Figure Recognition (ConFiguRe). Different from previous token-level or sentence-level counterparts, ConFiguRe aims at extracting a figurative unit from discourse-level context, and classifying the figurative unit into the right figure type. On ConFiguRe, three tasks, i.e., figure extraction, figure type classification and figure recognition, are designed and the state-of-the-art techniques are utilized to implement the benchmarks. We conduct thorough experiments and show that all three tasks are challenging for existing models, thus requiring further research. Our dataset and code are publicly available at https://github.com/pku-tangent/ConFiguRe.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes