IRJul 9, 2020

Enhancing spatial and textual analysis with EUPEG: an extensible and unified platform for evaluating geoparsers

arXiv:2007.04524v142 citationsHas Code
AI Analysis

This provides a unified benchmarking tool for researchers in geographic information extraction, though it is incremental as it builds on existing corpora and methods.

The paper tackles the difficulty of comparing geoparsers due to inconsistent datasets and metrics by introducing EUPEG, an open-source web platform that hosts corpora, geoparsers, and metrics, enabling direct comparisons and reducing preparation time for researchers.

A rich amount of geographic information exists in unstructured texts, such as Web pages, social media posts, housing advertisements, and historical archives. Geoparsers are useful tools that extract structured geographic information from unstructured texts, thereby enabling spatial analysis on textual data. While a number of geoparsers were developed, they were tested on different datasets using different metrics. Consequently, it is difficult to compare existing geoparsers or to compare a new geoparser with existing ones. In recent years, researchers created open and annotated corpora for testing geoparsers. While these corpora are extremely valuable, much effort is still needed for a researcher to prepare these datasets and deploy geoparsers for comparative experiments. This paper presents EUPEG: an Extensible and Unified Platform for Evaluating Geoparsers. EUPEG is an open source and Web based benchmarking platform which hosts a majority of open corpora, geoparsers, and performance metrics reported in the literature. It enables direct comparison of the hosted geoparsers, and a new geoparser can be connected to EUPEG and compared with other geoparsers. The main objective of EUPEG is to reduce the time and effort that researchers have to spend in preparing datasets and baselines, thereby increasing the efficiency and effectiveness of comparative experiments.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes