CLAISIApr 1

Multimodal Analysis of State-Funded News Coverage of the Israel-Hamas War on YouTube Shorts

arXiv:2604.0099482.2
Predicted impact top 62% in CL · last 90 daysOriginality Synthesis-oriented
AI Analysis

This work addresses the limited research on short-form video news for humanities scholars, though it is incremental as it applies existing methods to a new data format.

The researchers tackled the problem of analyzing how geopolitical events are represented in YouTube Shorts by developing a multimodal pipeline to examine state-funded news coverage of the Israel-Hamas war, finding that sentiment varied across outlets and over time while visual cues aligned with real-world events, with smaller models outperforming large ones for sentiment analysis.

YouTube Shorts have become central to news consumption on the platform, yet research on how geopolitical events are represented in this format remains limited. To address this gap, we present a multimodal pipeline that combines automatic transcription, aspect-based sentiment analysis (ABSA), and semantic scene classification. The pipeline is first assessed for feasibility and then applied to analyze short-form coverage of the Israel-Hamas war by state-funded outlets. Using over 2,300 conflict-related Shorts and more than 94,000 visual frames, we systematically examine war reporting across major international broadcasters. Our findings reveal that the sentiment expressed in transcripts regarding specific aspects differs across outlets and over time, whereas scene-type classifications reflect visual cues consistent with real-world events. Notably, smaller domain-adapted models outperform large transformers and even LLMs for sentiment analysis, underscoring the value of resource-efficient approaches for humanities research. The pipeline serves as a template for other short-form platforms, such as TikTok and Instagram, and demonstrates how multimodal methods, combined with qualitative interpretation, can characterize sentiment patterns and visual cues in algorithmically driven video environments.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes