CL MMJun 20, 2016

MOSI: Multimodal Corpus of Sentiment Intensity and Subjectivity Analysis in Online Opinion Videos

Amir Zadeh, Rowan Zellers, Eli Pincus, Louis-Philippe Morency

arXiv:1606.06259v229.6641 citations

Originality Synthesis-oriented

AI Analysis

This addresses the problem of understudied sentiment analysis in videos for researchers and industry, though it is incremental as it focuses on dataset creation and initial methods.

The paper tackles the lack of datasets and methods for sentiment analysis in online videos by introducing MOSI, the first opinion-level annotated corpus for sentiment intensity and subjectivity, and presents baselines and a multimodal fusion approach.

People are sharing their opinions, stories and reviews through online video sharing websites every day. Studying sentiment and subjectivity in these opinion videos is experiencing a growing attention from academia and industry. While sentiment analysis has been successful for text, it is an understudied research question for videos and multimedia content. The biggest setbacks for studies in this direction are lack of a proper dataset, methodology, baselines and statistical analysis of how information from different modality sources relate to each other. This paper introduces to the scientific community the first opinion-level annotated corpus of sentiment and subjectivity analysis in online videos called Multimodal Opinion-level Sentiment Intensity dataset (MOSI). The dataset is rigorously annotated with labels for subjectivity, sentiment intensity, per-frame and per-opinion annotated visual features, and per-milliseconds annotated audio features. Furthermore, we present baselines for future studies in this direction as well as a new multimodal fusion approach that jointly models spoken words and visual gestures.

View on arXiv PDF

Similar