CLAug 16, 2019

BERT-Based Multi-Head Selection for Joint Entity-Relation Extraction

arXiv:1908.05908v20.0034 citations
AI Analysis55

This work addresses information extraction for natural language processing applications, representing an incremental improvement over existing methods.

The paper tackles joint entity-relation extraction by incorporating BERT into a multi-head selection framework, achieving an F1-score of 0.876 with a single model and up to 0.8924 with ensembling in a 2019 challenge.

In this paper, we report our method for the Information Extraction task in 2019 Language and Intelligence Challenge. We incorporate BERT into the multi-head selection framework for joint entity-relation extraction. This model extends existing approaches from three perspectives. First, BERT is adopted as a feature extraction layer at the bottom of the multi-head selection framework. We further optimize BERT by introducing a semantic-enhanced task during BERT pre-training. Second, we introduce a large-scale Baidu Baike corpus for entity recognition pre-training, which is of weekly supervised learning since there is no actual named entity label. Third, soft label embedding is proposed to effectively transmit information between entity recognition and relation extraction. Combining these three contributions, we enhance the information extracting ability of the multi-head selection model and achieve F1-score 0.876 on testset-1 with a single model. By ensembling four variants of our model, we finally achieve F1 score 0.892 (1st place) on testset-1 and F1 score 0.8924 (2nd place) on testset-2.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes