CLIRLGApr 15, 2021

Detecting Polarized Topics Using Partisanship-aware Contextualized Topic Embeddings

arXiv:2104.07814v2663 citations
Originality Incremental advance
AI Analysis

This addresses the urgent need for early identification of polarized topics to mitigate conflict, though it is incremental as it builds on existing language models and polarization measurement techniques.

The authors tackled the problem of measuring topic-wise polarization in news media by proposing Partisanship-aware Contextualized Topic Embeddings (PaCTE), which automatically detects polarized topics from partisan news sources and demonstrated its efficacy in retrieving the most polarized topics on a COVID-19 dataset.

Growing polarization of the news media has been blamed for fanning disagreement, controversy and even violence. Early identification of polarized topics is thus an urgent matter that can help mitigate conflict. However, accurate measurement of topic-wise polarization is still an open research challenge. To address this gap, we propose Partisanship-aware Contextualized Topic Embeddings (PaCTE), a method to automatically detect polarized topics from partisan news sources. Specifically, utilizing a language model that has been finetuned on recognizing partisanship of the news articles, we represent the ideology of a news corpus on a topic by corpus-contextualized topic embedding and measure the polarization using cosine distance. We apply our method to a dataset of news articles about the COVID-19 pandemic. Extensive experiments on different news sources and topics demonstrate the efficacy of our method to capture topical polarization, as indicated by its effectiveness of retrieving the most polarized topics.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes