CLJul 4, 2022

Location reference recognition from texts: A survey and comparison

arXiv:2207.01683v186 citationsh-index: 59
Originality Synthesis-oriented
AI Analysis

It provides a systematic overview for researchers and practitioners in fields like disaster management and geographic information retrieval, but it is incremental as it consolidates existing knowledge rather than introducing new methods.

This survey addresses the lack of a comprehensive review and comparison of approaches for location reference recognition in geoparsing, summarizing seven application domains and evaluating 27 methods on 26 datasets with 39,736 location references to guide future developments and selection.

A vast amount of location information exists in unstructured texts, such as social media posts, news stories, scientific articles, web pages, travel blogs, and historical archives. Geoparsing refers to the process of recognizing location references from texts and identifying their geospatial representations. While geoparsing can benefit many domains, a summary of the specific applications is still missing. Further, there lacks a comprehensive review and comparison of existing approaches for location reference recognition, which is the first and a core step of geoparsing. To fill these research gaps, this review first summarizes seven typical application domains of geoparsing: geographic information retrieval, disaster management, disease surveillance, traffic management, spatial humanities, tourism management, and crime management. We then review existing approaches for location reference recognition by categorizing these approaches into four groups based on their underlying functional principle: rule-based, gazetteer matching-based, statistical learning-based, and hybrid approaches. Next, we thoroughly evaluate the correctness and computational efficiency of the 27 most widely used approaches for location reference recognition based on 26 public datasets with different types of texts (e.g., social media posts and news stories) containing 39,736 location references across the world. Results from this thorough evaluation can help inform future methodological developments for location reference recognition, and can help guide the selection of proper approaches based on application needs.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes