Duplicate Detection as a Service
This provides an accessible tool for users without expert knowledge to enhance knowledge graph completeness, though it is incremental in automating existing methods.
The paper tackles the problem of duplicate detection in knowledge graphs, which is crucial for improving completeness and application performance, by introducing a service-based, no-code solution that is competitive with state-of-the-art methods and has been adopted industrially.
Completeness of a knowledge graph is an important quality dimension and factor on how well an application that makes use of it performs. Completeness can be improved by performing knowledge enrichment. Duplicate detection aims to find identity links between the instances of knowledge graphs and is a fundamental subtask of knowledge enrichment. Current solutions to the problem require expert knowledge of the tool and the knowledge graph they are applied to. Users might not have this expert knowledge. We present our service-based approach to the duplicate detection task that provides an easy-to-use no-code solution that is still competitive with the state-of-the-art and has recently been adopted in an industrial context. The evaluation will be based on several frequently used test scenarios.