CLAILGOct 22, 2020

An Industry Evaluation of Embedding-based Entity Alignment

arXiv:2010.11522v2997 citations
Originality Synthesis-oriented
AI Analysis

This work addresses the gap between academic research and practical industrial deployment of entity alignment methods, particularly for medical applications.

The study evaluated state-of-the-art embedding-based entity alignment methods in an industrial context, analyzing their performance with varying seed mapping sizes and biases, and introduced a new medical benchmark from deployed knowledge graphs.

Embedding-based entity alignment has been widely investigated in recent years, but most proposed methods still rely on an ideal supervised learning setting with a large number of unbiased seed mappings for training and validation, which significantly limits their usage. In this study, we evaluate those state-of-the-art methods in an industrial context, where the impact of seed mappings with different sizes and different biases is explored. Besides the popular benchmarks from DBpedia and Wikidata, we contribute and evaluate a new industrial benchmark that is extracted from two heterogeneous knowledge graphs (KGs) under deployment for medical applications. The experimental results enable the analysis of the advantages and disadvantages of these alignment methods and the further discussion of suitable strategies for their industrial deployment.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes