CYAIAug 14, 2020

Challenges of Linking Organizational Information in Open Government Data to Knowledge Graphs

arXiv:2008.06232v11 citations
AI Analysis

This addresses the issue for researchers and practitioners needing fine-grained analyses of OGD, but it is incremental as it highlights existing problems without proposing a novel solution.

The paper tackles the problem of linking publishing organizations in Open Government Data (OGD) portals to knowledge graphs like Wikidata and DBpedia, identifying significant challenges such as ambiguous identifiers, temporal changes, and data quality issues, and provides an analysis and suggestions for addressing these open challenges.

Open Government Data (OGD) is being published by various public administration organizations around the globe. Within the metadata of OGD data catalogs, the publishing organizations (1) are not uniquely and unambiguously identifiable and, even worse, (2) change over time, by public administration units being merged or restructured. In order to enable fine-grained analyses or searches on Open Government Data on the level of publishing organizations, linking those from OGD portals to publicly available knowledge graphs (KGs) such as Wikidata and DBpedia seems like an obvious solution. Still, as we show in this position paper, organization linking faces significant challenges, both in terms of available (portal) metadata and KGs in terms of data quality and completeness. We herein specifically highlight five main challenges, namely regarding (1) temporal changes in organizations and in the portal metadata, (2) lack of a base ontology for describing organizational structures and changes in public knowledge graphs, (3) metadata and KG data quality, (4) multilinguality, and (5) disambiguating public sector organizations. Based on available OGD portal metadata from the Open Data Portal Watch, we provide an in-depth analysis of these issues, make suggestions for concrete starting points on how to tackle them along with a call to the community to jointly work on these open challenges.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes