DBAIFeb 3, 2025

Common Foundations for SHACL, ShEx, and PG-Schema

arXiv:2502.01295v111 citationsh-index: 25WWW
Originality Synthesis-oriented
AI Analysis

This work addresses the confusion for users in semantic web and database communities by incrementally synthesizing and comparing existing schema languages to improve data quality and interoperability in graph-based applications.

The paper tackles the problem of comparing and unifying three independent graph schema languages (SHACL, ShEx, and PG-Schema) by providing formal definitions and a uniform framework to identify common functionalities and differences, resulting in a comprehensive analysis that clarifies their overlapping and distinctive features.

Graphs have emerged as an important foundation for a variety of applications, including capturing and reasoning over factual knowledge, semantic data integration, social networks, and providing factual knowledge for machine learning algorithms. To formalise certain properties of the data and to ensure data quality, there is a need to describe the schema of such graphs. Because of the breadth of applications and availability of different data models, such as RDF and property graphs, both the Semantic Web and the database community have independently developed graph schema languages: SHACL, ShEx, and PG-Schema. Each language has its unique approach to defining constraints and validating graph data, leaving potential users in the dark about their commonalities and differences. In this paper, we provide formal, concise definitions of the core components of each of these schema languages. We employ a uniform framework to facilitate a comprehensive comparison between the languages and identify a common set of functionalities, shedding light on both overlapping and distinctive features of the three languages.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes