IRAINov 23, 2024

Class Order Disorder in Wikidata and First Fixes

arXiv:2411.15550v11 citationsh-index: 57
Originality Synthesis-oriented
AI Analysis

This addresses data quality issues in Wikidata, a widely used knowledge base, but is incremental as it focuses on existing problems without introducing new methods.

The study identified and quantified violations of class order in the Wikidata ontology, revealing prevalent issues through SPARQL queries, and demonstrated improvements by manually fixing some problems, though specific numbers were not provided.

Wikidata has a large ontology with classes at several orders. The Wikidata ontology has long been known to have violations of class order and information related to class order that appears suspect. SPARQL queries were evaluated against Wikidata to determine the prevalence of several kinds of violations and suspect information and the results analyzed. Some changes were manually made to Wikidata to remove some of these results and the queries rerun, showing the effect of the changes. Suggestions are provided on how the problems uncovered might be addressed, either though better tooling or involvement of the Wikidata community.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes