CLFeb 28, 2023

Is Japanese CCGBank empirically correct? A case study of passive and causative constructions

arXiv:2302.14708v128.8288 citationsh-index: 13

Originality Synthesis-oriented

AI Analysis

This work addresses a data quality issue for researchers developing Japanese CCG parsers, but it is incremental as it focuses on specific constructions.

The paper tackled the problem of verifying the linguistic validity of the Japanese CCGBank, showing that it yields empirically wrong predictions for nested passive and causative constructions when used with the ccg2lambda semantic parsing system.

The Japanese CCGBank serves as training and evaluation data for developing Japanese CCG parsers. However, since it is automatically generated from the Kyoto Corpus, a dependency treebank, its linguistic validity still needs to be sufficiently verified. In this paper, we focus on the analysis of passive/causative constructions in the Japanese CCGBank and show that, together with the compositional semantics of ccg2lambda, a semantic parsing system, it yields empirically wrong predictions for the nested construction of passives and causatives.

View on arXiv PDF

Similar