Class Order Disorder in Wikidata and First Fixes
This addresses data quality issues in Wikidata, a widely used knowledge base, but is incremental as it focuses on existing problems without introducing new methods.
The study identified and quantified violations of class order in the Wikidata ontology, revealing prevalent issues through SPARQL queries, and demonstrated improvements by manually fixing some problems, though specific numbers were not provided.
Wikidata has a large ontology with classes at several orders. The Wikidata ontology has long been known to have violations of class order and information related to class order that appears suspect. SPARQL queries were evaluated against Wikidata to determine the prevalence of several kinds of violations and suspect information and the results analyzed. Some changes were manually made to Wikidata to remove some of these results and the queries rerun, showing the effect of the changes. Suggestions are provided on how the problems uncovered might be addressed, either though better tooling or involvement of the Wikidata community.