Columbia MVSO Image Sentiment Dataset
This provides a new benchmark for researchers working on image sentiment analysis, though it is incremental as it builds upon existing multilingual data.
The researchers tackled the problem of evaluating automatic sentiment prediction in images by creating a benchmark dataset, resulting in a collection of 3,911 English Adjective-Noun Pairs with human-labeled sentiments from crowd-sourced judgments.
The Multilingual Visual Sentiment Ontology (MVSO) consists of 15,600 concepts in 12 different languages that are strongly related to emotions and sentiments expressed in images. These concepts are defined in the form of Adjective-Noun Pair (ANP), which are crawled and discovered from online image forum Flickr. In this work, we used Amazon Mechanical Turk as a crowd-sourcing platform to collect human judgments on sentiments expressed in images that are uniformly sampled over 3,911 English ANPs extracted from a tag-restricted subset of MVSO. Our goal is to use the dataset as a benchmark for the evaluation of systems that automatically predict sentiments in images or ANPs.