IRLGMLJun 9, 2018

DIR-ST$^2$: Delineation of Imprecise Regions Using Spatio--Temporal--Textual Information

arXiv:1806.03482v11 citations
Originality Incremental advance
AI Analysis

This work addresses the challenge of accurately defining vague geographical areas for applications like urban planning or social media analysis, though it appears incremental as it builds on existing clustering methods with added temporal data.

The paper tackles the problem of delineating imprecise geographical regions by proposing DIR-ST$^2$, a framework that uses spatio-temporal-textual information from social media to iteratively perform density-based clustering, resulting in significant noise reduction and outperforming a state-of-the-art method with improved $\mathcal{F}_1$ scores.

An imprecise region is referred to as a geographical area without a clearly-defined boundary in the literature. Previous clustering-based approaches exploit spatial information to find such regions. However, the prior studies suffer from the following two problems: the subjectivity in selecting clustering parameters and the inclusion of a large portion of the undesirable region (i.e., a large number of noise points). To overcome these problems, we present DIR-ST$^2$, a novel framework for delineating an imprecise region by iteratively performing density-based clustering, namely DBSCAN, along with not only spatio--textual information but also temporal information on social media. Specifically, we aim at finding a proper radius of a circle used in the iterative DBSCAN process by gradually reducing the radius for each iteration in which the temporal information acquired from all resulting clusters are leveraged. Then, we propose an efficient and automated algorithm delineating the imprecise region via hierarchical clustering. Experiment results show that by virtue of the significant noise reduction in the region, our DIR-ST$^2$ method outperforms the state-of-the-art approach employing one-class support vector machine in terms of the $\mathcal{F}_1$ score from comparison with precisely-defined regions regarded as a ground truth, and returns apparently better delineation of imprecise regions. The computational complexity of DIR-ST$^2$ is also analytically and numerically shown.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes