LGMLOct 25, 2019

Using Pairwise Occurrence Information to Improve Knowledge Graph Completion on Large-Scale Datasets

arXiv:1910.11583v1996 citations
Originality Incremental advance
AI Analysis

This work addresses memory constraints in training knowledge graph completion models for large-scale datasets, representing an incremental improvement over existing methods.

The paper tackled the performance bottleneck of bilinear models like DistMult and ComplEx in knowledge graph completion on large-scale datasets by using pairwise occurrence information to construct a joint learning model and improve negative sampling. The result was a significant performance improvement, with a 2.8% absolute gain in hits@1 on a dataset of 2 million entities.

Bilinear models such as DistMult and ComplEx are effective methods for knowledge graph (KG) completion. However, they require large batch sizes, which becomes a performance bottleneck when training on large scale datasets due to memory constraints. In this paper we use occurrences of entity-relation pairs in the dataset to construct a joint learning model and to increase the quality of sampled negatives during training. We show on three standard datasets that when these two techniques are combined, they give a significant improvement in performance, especially when the batch size and the number of generated negative examples are low relative to the size of the dataset. We then apply our techniques to a dataset containing 2 million entities and demonstrate that our model outperforms the baseline by 2.8% absolute on hits@1.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes