CLDec 2, 2024

Revisiting Absence withSymptoms that *T* Show up Decades Later to Recover Empty Categories

arXiv:2412.01109v1h-index: 3
Originality Incremental advance
AI Analysis

It addresses a syntactic parsing challenge for computational linguists, but is incremental as it extends existing methods to new languages.

This paper tackles the problem of restoring null elements in parse trees for English, Chinese, and Korean, achieving F1 scores up to 90.94, 85.38, and 88.79 respectively with a neural model.

This paper explores null elements in English, Chinese, and Korean Penn treebanks. Null elements contain important syntactic and semantic information, yet they have typically been treated as entities to be removed during language processing tasks, particularly in constituency parsing. Thus, we work towards the removal and, in particular, the restoration of null elements in parse trees. We focus on expanding a rule-based approach utilizing linguistic context information to Chinese, as rule based approaches have historically only been applied to English. We also worked to conduct neural experiments with a language agnostic sequence-to-sequence model to recover null elements for English (PTB), Chinese (CTB) and Korean (KTB). To the best of the authors' knowledge, null elements in three different languages have been explored and compared for the first time. In expanding a rule based approach to Chinese, we achieved an overall F1 score of 80.00, which is comparable to past results in the CTB. In our neural experiments we achieved F1 scores up to 90.94, 85.38 and 88.79 for English, Chinese, and Korean respectively with functional labels.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes