CL IR LGApr 15, 2021

Detecting Polarized Topics Using Partisanship-aware Contextualized Topic Embeddings

Zihao He, Negar Mokhberian, Antonio Camara, Andres Abeliuk, Kristina Lerman

arXiv:2104.07814v230.7663 citationsHas Code

Originality Incremental advance

AI Analysis

This addresses the urgent need for early identification of polarized topics to mitigate conflict, though it is incremental as it builds on existing language models and polarization measurement techniques.

The authors tackled the problem of measuring topic-wise polarization in news media by proposing Partisanship-aware Contextualized Topic Embeddings (PaCTE), which automatically detects polarized topics from partisan news sources and demonstrated its efficacy in retrieving the most polarized topics on a COVID-19 dataset.

Growing polarization of the news media has been blamed for fanning disagreement, controversy and even violence. Early identification of polarized topics is thus an urgent matter that can help mitigate conflict. However, accurate measurement of topic-wise polarization is still an open research challenge. To address this gap, we propose Partisanship-aware Contextualized Topic Embeddings (PaCTE), a method to automatically detect polarized topics from partisan news sources. Specifically, utilizing a language model that has been finetuned on recognizing partisanship of the news articles, we represent the ideology of a news corpus on a topic by corpus-contextualized topic embedding and measure the polarization using cosine distance. We apply our method to a dataset of news articles about the COVID-19 pandemic. Extensive experiments on different news sources and topics demonstrate the efficacy of our method to capture topical polarization, as indicated by its effectiveness of retrieving the most polarized topics.

View on arXiv PDF Code

Similar