CLLGAug 28, 2020

TATL at W-NUT 2020 Task 2: A Transformer-based Baseline System for Identification of Informative COVID-19 English Tweets

arXiv:2008.12854v1
Originality Synthesis-oriented
AI Analysis

This work addresses the need for filtering uninformative COVID-19 tweets for downstream applications, but it is incremental as it applies an existing method to a new dataset.

The paper tackled the problem of automatically identifying informative COVID-19 English tweets from social media, achieving competitive results by ranking 8th out of 56 teams in a shared task.

As the COVID-19 outbreak continues to spread throughout the world, more and more information about the pandemic has been shared publicly on social media. For example, there are a huge number of COVID-19 English Tweets daily on Twitter. However, the majority of those Tweets are uninformative, and hence it is important to be able to automatically select only the informative ones for downstream applications. In this short paper, we present our participation in the W-NUT 2020 Shared Task 2: Identification of Informative COVID-19 English Tweets. Inspired by the recent advances in pretrained Transformer language models, we propose a simple yet effective baseline for the task. Despite its simplicity, our proposed approach shows very competitive results in the leaderboard as we ranked 8 over 56 teams participated in total.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes