CLAISep 30, 2021

Multilingual AMR Parsing with Noisy Knowledge Distillation

arXiv:2109.15196v2663 citations
Originality Incremental advance
AI Analysis

This work addresses the problem of improving multilingual AMR parsing for NLP applications, representing an incremental advance through the application of knowledge distillation in a strict multilingual setting.

The paper tackled multilingual Abstract Meaning Representation (AMR) parsing by using knowledge distillation from an English parser to train a single model for multiple languages, achieving large performance gains of up to 18.8 Smatch points on Chinese and comparable results on English to state-of-the-art parsers.

We study multilingual AMR parsing from the perspective of knowledge distillation, where the aim is to learn and improve a multilingual AMR parser by using an existing English parser as its teacher. We constrain our exploration in a strict multilingual setting: there is but one model to parse all different languages including English. We identify that noisy input and precise output are the key to successful distillation. Together with extensive pre-training, we obtain an AMR parser whose performances surpass all previously published results on four different foreign languages, including German, Spanish, Italian, and Chinese, by large margins (up to 18.8 \textsc{Smatch} points on Chinese and on average 11.3 \textsc{Smatch} points). Our parser also achieves comparable performance on English to the latest state-of-the-art English-only parser.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes