CLJul 25, 2019

Deep Ranking Based Cost-sensitive Multi-label Learning for Distant Supervision Relation Extraction

arXiv:1907.11521v12 citations
Originality Incremental advance
AI Analysis

This work addresses challenges in constructing knowledge bases for information retrieval systems, representing an incremental improvement in relation extraction methods.

The paper tackles the problem of relation overlapping and class imbalance in distant supervision relation extraction by proposing a ranking-based multi-label learning framework with convolutional neural networks and cost-sensitive learning, achieving improved performance on a widely used dataset.

Knowledge base provides a potential way to improve the intelligence of information retrieval (IR) systems, for that knowledge base has numerous relations between entities which can help the IR systems to conduct inference from one entity to another entity. Relation extraction is one of the fundamental techniques to construct a knowledge base. Distant supervision is a semi-supervised learning method for relation extraction which learns with labeled and unlabeled data. However, this approach suffers the problem of relation overlapping in which one entity tuple may have multiple relation facts. We believe that relation types can have latent connections, which we call class ties, and can be exploited to enhance relation extraction. However, this property between relation classes has not been fully explored before. In this paper, to exploit class ties between relations to improve relation extraction, we propose a general ranking based multi-label learning framework combined with convolutional neural networks, in which ranking based loss functions with regularization technique are introduced to learn the latent connections between relations. Furthermore, to deal with the problem of class imbalance in distant supervision relation extraction, we further adopt cost-sensitive learning to rescale the costs from the positive and negative labels. Extensive experiments on a widely used dataset show the effectiveness of our model to exploit class ties and to relieve class imbalance problem.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes