DLApr 23

OpenCitations Meta

arXiv:2306.1619161.712 citationsh-index: 36
AI Analysis

For researchers and bibliometricians, it provides the largest open bibliographic metadata source with enhanced performance, data integrity, and interoperability, though it is an incremental improvement over existing databases.

OpenCitations Meta is a new open bibliographic metadata database that integrates data from Crossref, DataCite, and PubMed, using Semantic Web technologies and assigning unique OMIDs for disambiguation. It eliminates reliance on external APIs, includes automated curation with deduplication and error correction, and provides superior interoperability via SPARQL, REST APIs, and data dumps.

OpenCitations Meta is a new database for open bibliographic metadata of scholarly publications involved in the citations indexed by the OpenCitations infrastructure, adhering to Open Science principles and published under a CC0 license to promote maximum reuse. It presently incorporates bibliographic metadata for publications recorded in Crossref, DataCite and PubMed, making it the largest bibliographic metadata source using Semantic Web technologies. It assigns new globally persistent identifiers (PIDs), known as OpenCitations Meta Identifiers (OMIDs) to all bibliographic resources, enabling it both to disambiguate publications described using different external PIDS (e.g., a DOI in Crossref and a PMID in PubMed), and to handle citations involving publications lacking external PIDs. By hosting bibliographic metadata internally, OpenCitations Meta eliminates its former reliance on API calls to external resources and thus enhances performance in response to user queries. Its automated data curation, following the OpenCitations Data Model, includes deduplication, error correction, metadata enrichment and full provenance tracking, ensuring transparency and traceability of data and bolstering confidence in data integrity, a feature unparalleled in other bibliographic databases. Its commitment to Semantic Web standards ensures superior interoperability compared to other machine-readable formats, with availability via a SPARQL endpoint, REST APIs and data dumps.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes