CLSIJan 22

A Longitudinal, Multinational, and Multilingual Corpus of News Coverage of the Russo-Ukrainian War

arXiv:2601.16309v1h-index: 1
Originality Synthesis-oriented
AI Analysis

This provides a foundational resource for computational journalism and transnational discourse analysis of wartime information ecosystems.

The authors introduced DNIPRO, a longitudinal corpus of 246K multilingual news articles covering the Russo-Ukrainian war from 2022-2024, which enables systematic analysis of competing geopolitical narratives through experiments showing polarized interpretations across outlets.

We introduce DNIPRO, a novel longitudinal corpus of 246K news articles documenting the Russo-Ukrainian war from Feb 2022 to Aug 2024, spanning eleven media outlets across five nation states (Russia, Ukraine, U.S., U.K., and China) and three languages (English, Russian, and Mandarin Chinese). This multilingual resource features consistent and comprehensive metadata, and multiple types of annotation with rigorous human evaluations for downstream tasks relevant to systematic transnational analyses of contentious wartime discourse. DNIPRO's distinctive value lies in its inclusion of competing geopolitical perspectives, making it uniquely suited for studying narrative divergence, media framing, and information warfare. To demonstrate its utility, we include use case experiments using stance detection, sentiment analysis, topical framing, and contradiction analysis of major conflict events within the larger war. Our explorations reveal how outlets construct competing realities, with coverage exhibiting polarized interpretations that reflect geopolitical interests. Beyond supporting computational journalism research, DNIPRO provides a foundational resource for understanding how conflicting narratives emerge and evolve across global information ecosystems.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes