CL LGNov 24, 2019

CopyMTL: Copy Mechanism for Joint Extraction of Entities and Relations with Multi-Task Learning

Daojian Zeng, Ranran Haoran Zhang, Qianying Liu

arXiv:1911.10438v26.2205 citationsh-index: 16Has Code

Originality Incremental advance

AI Analysis

This work solves entity-relation extraction problems for natural language processing applications, representing an incremental improvement over existing methods.

The paper tackled the joint extraction of entities and relations by addressing CopyRE's weaknesses in distinguishing head and tail entities and handling multi-token entities, resulting in a 9% F1 score improvement on NYT and 16% on WebNLG datasets.

Joint extraction of entities and relations has received significant attention due to its potential of providing higher performance for both tasks. Among existing methods, CopyRE is effective and novel, which uses a sequence-to-sequence framework and copy mechanism to directly generate the relation triplets. However, it suffers from two fatal problems. The model is extremely weak at differing the head and tail entity, resulting in inaccurate entity extraction. It also cannot predict multi-token entities (e.g. \textit{Steven Jobs}). To address these problems, we give a detailed analysis of the reasons behind the inaccurate entity extraction problem, and then propose a simple but extremely effective model structure to solve this problem. In addition, we propose a multi-task learning framework equipped with copy mechanism, called CopyMTL, to allow the model to predict multi-token entities. Experiments reveal the problems of CopyRE and show that our model achieves significant improvement over the current state-of-the-art method by 9% in NYT and 16% in WebNLG (F1 score). Our code is available at https://github.com/WindChimeRan/CopyMTL

View on arXiv PDF Code

Similar