CLLGNov 24, 2019

CopyMTL: Copy Mechanism for Joint Extraction of Entities and Relations with Multi-Task Learning

arXiv:1911.10438v2205 citationsHas Code
Originality Incremental advance
AI Analysis

This work solves entity-relation extraction problems for natural language processing applications, representing an incremental improvement over existing methods.

The paper tackled the joint extraction of entities and relations by addressing CopyRE's weaknesses in distinguishing head and tail entities and handling multi-token entities, resulting in a 9% F1 score improvement on NYT and 16% on WebNLG datasets.

Joint extraction of entities and relations has received significant attention due to its potential of providing higher performance for both tasks. Among existing methods, CopyRE is effective and novel, which uses a sequence-to-sequence framework and copy mechanism to directly generate the relation triplets. However, it suffers from two fatal problems. The model is extremely weak at differing the head and tail entity, resulting in inaccurate entity extraction. It also cannot predict multi-token entities (e.g. \textit{Steven Jobs}). To address these problems, we give a detailed analysis of the reasons behind the inaccurate entity extraction problem, and then propose a simple but extremely effective model structure to solve this problem. In addition, we propose a multi-task learning framework equipped with copy mechanism, called CopyMTL, to allow the model to predict multi-token entities. Experiments reveal the problems of CopyRE and show that our model achieves significant improvement over the current state-of-the-art method by 9% in NYT and 16% in WebNLG (F1 score). Our code is available at https://github.com/WindChimeRan/CopyMTL

Code Implementations2 repos
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes