Making Better Use of Bilingual Information for Cross-Lingual AMR Parsing
This work addresses the challenge of improving semantic accuracy in cross-lingual AMR parsing for natural language processing applications, representing an incremental advance.
The paper tackled the problem of less specific concepts in cross-lingual AMR parsing by introducing bilingual input and an auxiliary task, resulting in a 10.6-point improvement in Smatch F1 score over previous state-of-the-art methods.
Abstract Meaning Representation (AMR) is a rooted, labeled, acyclic graph representing the semantics of natural language. As previous works show, although AMR is designed for English at first, it can also represent semantics in other languages. However, they find that concepts in their predicted AMR graphs are less specific. We argue that the misprediction of concepts is due to the high relevance between English tokens and AMR concepts. In this work, we introduce bilingual input, namely the translated texts as well as non-English texts, in order to enable the model to predict more accurate concepts. Besides, we also introduce an auxiliary task, requiring the decoder to predict the English sequences at the same time. The auxiliary task can help the decoder understand what exactly the corresponding English tokens are. Our proposed cross-lingual AMR parser surpasses previous state-of-the-art parser by 10.6 points on Smatch F1 score. The ablation study also demonstrates the efficacy of our proposed modules.