CL CYMar 7, 2014

Finding Eyewitness Tweets During Crises

Fred Morstatter, Nichola Lubold, Heather Pon-Barry, Jürgen Pfeffer, Huan Liu

arXiv:1403.1773v154 citations

Originality Synthesis-oriented

AI Analysis

This addresses a critical bottleneck for disaster response agencies in accessing timely, location-specific information from social media, though it is incremental in applying existing NLP techniques to a new domain.

The paper tackled the problem of identifying non-geotagged tweets from within crisis regions to aid disaster response, by analyzing linguistic differences and developing a real-time classification method, achieving results that enable automatic identification.

Disaster response agencies have started to incorporate social media as a source of fast-breaking information to understand the needs of people affected by the many crises that occur around the world. These agencies look for tweets from within the region affected by the crisis to get the latest updates of the status of the affected region. However only 1% of all tweets are geotagged with explicit location information. First responders lose valuable information because they cannot assess the origin of many of the tweets they collect. In this work we seek to identify non-geotagged tweets that originate from within the crisis region. Towards this, we address three questions: (1) is there a difference between the language of tweets originating within a crisis region and tweets originating outside the region, (2) what are the linguistic patterns that can be used to differentiate within-region and outside-region tweets, and (3) for non-geotagged tweets, can we automatically identify those originating within the crisis region in real-time?

View on arXiv PDF

Similar