IRDec 5, 2020

Aligning geographic entities from historical maps for building knowledge graphs

arXiv:2012.03069v143 citations
AI Analysis

This work provides a method for researchers and historians to more easily synthesize geographic information from multiple historical maps, which are often the only source of historical geographic data.

This paper addresses the challenge of aligning geographic entities across different historical maps to facilitate the creation of geographic knowledge graphs. The authors propose a workflow and evaluate methods, finding that a combination of string similarity, spatial distance, and approximate topological relation achieves the best performance with an average F-score of 0.89.

Historical maps contain rich geographic information about the past of a region. They are sometimes the only source of information before the availability of digital maps. Despite their valuable content, it is often challenging to access and use the information in historical maps, due to their forms of paper-based maps or scanned images. It is even more time-consuming and labor-intensive to conduct an analysis that requires a synthesis of the information from multiple historical maps. To facilitate the use of the geographic information contained in historical maps, one way is to build a geographic knowledge graph (GKG) from them. This paper proposes a general workflow for completing one important step of building such a GKG, namely aligning the same geographic entities from different maps. We present this workflow and the related methods for implementation, and systematically evaluate their performances using two different datasets of historical maps. The evaluation results show that machine learning and deep learning models for matching place names are sensitive to the thresholds learned from the training data, and a combination of measures based on string similarity, spatial distance, and approximate topological relation achieves the best performance with an average F-score of 0.89.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes