AINov 28, 2017

Classification of entities via their descriptive sentences

arXiv:1711.10317v1
Originality Synthesis-oriented
AI Analysis

This work addresses the problem of low precision or recall in hypernym identification for taxonomy construction, but it is incremental as it builds on existing classification and clustering techniques.

The paper tackled hypernym identification for open-domain entities by using a classification-based method with a pre-defined taxonomy, achieving 99.36% precision on 1.1 million out of 2.1 million Baidu Baike entities.

Hypernym identification of open-domain entities is crucial for taxonomy construction as well as many higher-level applications. Current methods suffer from either low precision or low recall. To decrease the difficulty of this problem, we adopt a classification-based method. We pre-define a concept taxonomy and classify an entity to one of its leaf concept, based on the name and description information of the entity. A convolutional neural network classifier and a K-means clustering module are adopted for classification. We applied this system to 2.1 million Baidu Baike entities, and 1.1 million of them were successfully identified with a precision of 99.36%.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes