CLJan 8, 2025

PolInterviews -- A Dataset of German Politician Public Broadcast Interviews

arXiv:2501.04484v2
AI Analysis

This provides the first dataset of its kind for studying German political communication, though it is incremental as it applies existing data collection methods to a new domain.

The authors introduced PolInterviews, a dataset of 99 public broadcast interviews with 33 German politicians, containing 28,146 sentences, to enable research on political communication topics like agenda-setting and interviewer dynamics.

This paper presents a novel dataset of public broadcast interviews featuring high-ranking German politicians. The interviews were sourced from YouTube, transcribed, processed for speaker identification, and stored in a tidy and open format. The dataset comprises 99 interviews with 33 different German politicians across five major interview formats, containing a total of 28,146 sentences. As the first of its kind, this dataset offers valuable opportunities for research on various aspects of political communication in the (German) political contexts, such as agenda-setting, interviewer dynamics, or politicians' self-presentation.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes