CLSISep 15, 2018

Geo-Text Data and Data-Driven Geospatial Semantics

arXiv:1809.05636v152 citations
Originality Synthesis-oriented
AI Analysis

This work addresses the processing and utilization challenges of geo-text data for researchers in geospatial semantics, but it is incremental as it primarily reviews existing studies.

The paper reviews geo-text data, which links geographic locations to natural language texts, and systematically analyzes studies that extract knowledge from it, presenting a generalized workflow and future challenges.

Many datasets nowadays contain links between geographic locations and natural language texts. These links can be geotags, such as geotagged tweets or geotagged Wikipedia pages, in which location coordinates are explicitly attached to texts. These links can also be place mentions, such as those in news articles, travel blogs, or historical archives, in which texts are implicitly connected to the mentioned places. This kind of data is referred to as geo-text data. The availability of large amounts of geo-text data brings both challenges and opportunities. On the one hand, it is challenging to automatically process this kind of data due to the unstructured texts and the complex spatial footprints of some places. On the other hand, geo-text data offers unique research opportunities through the rich information contained in texts and the special links between texts and geography. As a result, geo-text data facilitates various studies especially those in data-driven geospatial semantics. This paper discusses geo-text data and related concepts. With a focus on data-driven research, this paper systematically reviews a large number of studies that have discovered multiple types of knowledge from geo-text data. Based on the literature review, a generalized workflow is extracted and key challenges for future work are discussed.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes