DBAIJul 24, 2019

The sameAs Problem: A Survey on Identity Management in the Web of Data

arXiv:1907.10528v111 citations
Originality Synthesis-oriented
AI Analysis

This addresses identity management issues for decentralized knowledge representation systems, but it is a survey paper, so it is incremental in summarizing existing work.

The paper surveys the 'sameAs problem' in the Web of Data, where incorrect use of identity statements like owl:sameAs can cause widespread issues, and it analyzes existing solutions and open challenges.

In a decentralised knowledge representation system such as the Web of Data, it is common and indeed desirable for different knowledge graphs to overlap. Whenever multiple names are used to denote the same thing, owl:sameAs statements are needed in order to link the data and foster reuse. Whilst the deductive value of such identity statements can be extremely useful in enhancing various knowledge-based systems, incorrect use of identity can have wide-ranging effects in a global knowledge space like the Web of Data. With several works already proven that identity in the Web is broken, this survey investigates the current state of this "sameAs problem". An open discussion highlights the main weaknesses suffered by solutions in the literature, and draws open challenges to be faced in the future.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes