CL LGAug 28, 2020

TATL at W-NUT 2020 Task 2: A Transformer-based Baseline System for Identification of Informative COVID-19 English Tweets

arXiv:2008.12854v10.2

Originality Synthesis-oriented

AI Analysis

This work addresses the need for filtering uninformative COVID-19 tweets for downstream applications, but it is incremental as it applies an existing method to a new dataset.

The paper tackled the problem of automatically identifying informative COVID-19 English tweets from social media, achieving competitive results by ranking 8th out of 56 teams in a shared task.

As the COVID-19 outbreak continues to spread throughout the world, more and more information about the pandemic has been shared publicly on social media. For example, there are a huge number of COVID-19 English Tweets daily on Twitter. However, the majority of those Tweets are uninformative, and hence it is important to be able to automatically select only the informative ones for downstream applications. In this short paper, we present our participation in the W-NUT 2020 Shared Task 2: Identification of Informative COVID-19 English Tweets. Inspired by the recent advances in pretrained Transformer language models, we propose a simple yet effective baseline for the task. Despite its simplicity, our proposed approach shows very competitive results in the leaderboard as we ranked 8 over 56 teams participated in total.

View on arXiv PDF

Similar