IRDLHCApr 12, 2012

Collaboratively Patching Linked Data

arXiv:1204.2715v124 citations
Originality Incremental advance
AI Analysis

This addresses data quality issues in the Web of Data for publishers and consumers, but it is incremental as it builds on existing curation efforts.

The paper tackles the problem of noisy and erroneous Linked Data by proposing a collaborative approach to expose and reuse patches, demonstrated through a game that patches DBpedia statements and provides change notifications.

Today's Web of Data is noisy. Linked Data often needs extensive preprocessing to enable efficient use of heterogeneous resources. While consistent and valid data provides the key to efficient data processing and aggregation we are facing two main challenges: (1st) Identification of erroneous facts and tracking their origins in dynamically connected datasets is a difficult task, and (2nd) efforts in the curation of deficient facts in Linked Data are exchanged rather rarely. Since erroneous data often is duplicated and (re-)distributed by mashup applications it is not only the responsibility of a few original publishers to keep their data tidy, but progresses to be a mission for all distributers and consumers of Linked Data too. We present a new approach to expose and to reuse patches on erroneous data to enhance and to add quality information to the Web of Data. The feasibility of our approach is demonstrated by example of a collaborative game that patches statements in DBpedia data and provides notifications for relevant changes.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes