CLApr 10, 2021

Representation Learning for Weakly Supervised Relation Extraction

arXiv:2105.00815v30.2

Originality Incremental advance

AI Analysis

This work addresses the challenge of data scarcity in relation extraction, an incremental improvement for information extraction tasks.

The paper tackles the problem of limited labeled data in relation extraction by using unsupervised pre-training to learn distributed text representation features, which when combined with traditional hand-crafted features, improves the performance of a logistic classification model, particularly for relations with few training instances.

Recent years have seen rapid development in Information Extraction, as well as its subtask, Relation Extraction. Relation Extraction is able to detect semantic relations between entities in sentences. Currently, many efficient approaches have been applied to relation extraction tasks. Supervised learning approaches especially have good performance. However, there are still many difficult challenges. One of the most serious problems is that manually labeled data is difficult to acquire. In most cases, limited data for supervised approaches equals lousy performance. Thus here, under the situation with only limited training data, we focus on how to improve the performance of our supervised baseline system with unsupervised pre-training. Feature is one of the key components in improving the supervised approaches. Traditional approaches usually apply hand-crafted features, which require expert knowledge and expensive human labor. However, this type of feature might suffer from data sparsity: when the training set size is small, the model parameters might be poorly estimated. In this thesis, we present several novel unsupervised pre-training models to learn the distributed text representation features, which are encoded with rich syntactic-semantic patterns of relation expressions. The experiments have demonstrated that this type of feature, combine with the traditional hand-crafted features, could improve the performance of the logistic classification model for relation extraction, especially on the classification of relations with only minor training instances.

View on arXiv PDF

Similar