CYCLLGSep 18, 2021

A Machine Learning Pipeline to Examine Political Bias with Congressional Speeches

arXiv:2109.09014v15 citations
Originality Incremental advance
AI Analysis

This work addresses the challenge of reducing human-intensive labeling for political bias analysis in social media, though it is incremental as it builds on existing machine learning pipelines.

The authors tackled the problem of predicting political bias in social media without human-annotated data by using transcripts from US congressional speeches to label data, achieving accuracies of 70.5% on Twitter and 65.1% on Gab, and forecasting cascade bias with about 85% accuracy.

Computational methods to model political bias in social media involve several challenges due to heterogeneity, high-dimensional, multiple modalities, and the scale of the data. Political bias in social media has been studied in multiple viewpoints like media bias, political ideology, echo chambers, and controversies using machine learning pipelines. Most of the current methods rely heavily on the manually-labeled ground-truth data for the underlying political bias prediction tasks. Limitations of such methods include human-intensive labeling, labels related to only a specific problem, and the inability to determine the near future bias state of a social media conversation. In this work, we address such problems and give machine learning approaches to study political bias in two ideologically diverse social media forums: Gab and Twitter without the availability of human-annotated data. Our proposed methods exploit the use of transcripts collected from political speeches in US congress to label the data and achieve the highest accuracy of 70.5% and 65.1% in Twitter and Gab data respectively to predict political bias. We also present a machine learning approach that combines features from cascades and text to forecast cascade's political bias with an accuracy of about 85%.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes