AIDLIRLGDec 9, 2021

Wikidated 1.0: An Evolving Knowledge Graph Dataset of Wikidata's Revision History

arXiv:2112.05003v11 citations
Originality Synthesis-oriented
AI Analysis

This addresses a data gap for researchers in the Semantic Web community studying evolving knowledge graphs, though it is incremental as it focuses on dataset creation rather than novel methods.

The paper tackles the lack of large datasets for evolving knowledge graphs by presenting Wikidated 1.0, a dataset of Wikidata's full revision history that encodes changes as RDF triple deletions and additions, and it provides statistical characteristics of this dataset.

Wikidata is the largest general-interest knowledge base that is openly available. It is collaboratively edited by thousands of volunteer editors and has thus evolved considerably since its inception in 2012. In this paper, we present Wikidated 1.0, a dataset of Wikidata's full revision history, which encodes changes between Wikidata revisions as sets of deletions and additions of RDF triples. To the best of our knowledge, it constitutes the first large dataset of an evolving knowledge graph, a recently emerging research subject in the Semantic Web community. We introduce the methodology for generating Wikidated 1.0 from dumps of Wikidata, discuss its implementation and limitations, and present statistical characteristics of the dataset.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes