SIHCLGMay 16, 2013

Inferring the Origin Locations of Tweets with Quantitative Confidence

arXiv:1305.3932v3125 citations
Originality Incremental advance
AI Analysis

This work addresses the need for accurate tweet location estimation to enhance utility in domains like public health and disaster management, though it is incremental as it builds on existing methods with improvements in scalability and uncertainty quantification.

The authors tackled the problem of missing geographic information in tweets, where fewer than 1.6% contain geotags, by proposing a scalable content-based approach using a novel Gaussian mixture model variant, achieving reliable and well-calibrated results competitive with previous methods on 13 million global tweets.

Social Internet content plays an increasingly critical role in many domains, including public health, disaster management, and politics. However, its utility is limited by missing geographic information; for example, fewer than 1.6% of Twitter messages (tweets) contain a geotag. We propose a scalable, content-based approach to estimate the location of tweets using a novel yet simple variant of gaussian mixture models. Further, because real-world applications depend on quantified uncertainty for such estimates, we propose novel metrics of accuracy, precision, and calibration, and we evaluate our approach accordingly. Experiments on 13 million global, comprehensively multi-lingual tweets show that our approach yields reliable, well-calibrated results competitive with previous computationally intensive methods. We also show that a relatively small number of training data are required for good estimates (roughly 30,000 tweets) and models are quite time-invariant (effective on tweets many weeks newer than the training set). Finally, we show that toponyms and languages with small geographic footprint provide the most useful location signals.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes