AICLLGMLJul 21, 2019

Quantifying Similarity between Relations with Fact Distribution

arXiv:1907.08937v11093 citationsHas Code
Originality Incremental advance
AI Analysis

This work addresses the challenge of relation similarity for knowledge base applications, offering incremental improvements in tasks like Open IE and relational classification.

The paper tackles the problem of quantifying similarity between relations in knowledge bases by introducing a method based on the divergence of conditional probability distributions over entity pairs, showing that its outputs significantly correlate with human judgments and can detect redundant relations and improve relational classification tasks.

We introduce a conceptually simple and effective method to quantify the similarity between relations in knowledge bases. Specifically, our approach is based on the divergence between the conditional probability distributions over entity pairs. In this paper, these distributions are parameterized by a very simple neural network. Although computing the exact similarity is in-tractable, we provide a sampling-based method to get a good approximation. We empirically show the outputs of our approach significantly correlate with human judgments. By applying our method to various tasks, we also find that (1) our approach could effectively detect redundant relations extracted by open information extraction (Open IE) models, that (2) even the most competitive models for relational classification still make mistakes among very similar relations, and that (3) our approach could be incorporated into negative sampling and softmax classification to alleviate these mistakes. The source code and experiment details of this paper can be obtained from https://github.com/thunlp/relation-similarity.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes