CVOct 3, 2018

SAVOIAS: A Diverse, Multi-Category Visual Complexity Dataset

arXiv:1810.01771v14.611 citationsHas Code

Originality Synthesis-oriented

AI Analysis

This dataset addresses a gap for researchers in cognitive psychology, computer vision, and visualization by enabling category-specific method development, though it is incremental as it builds on existing data collection efforts.

The authors tackled the lack of diverse, unbiased datasets for visual complexity analysis by introducing Savoias, a dataset of over 1,400 images across seven categories with crowdsourced ground truth, and found that state-of-the-art algorithms correlate poorly with human labels in most categories (e.g., r=0.3 for objects).

Visual complexity identifies the level of intricacy and details in an image or the level of difficulty to describe the image. It is an important concept in a variety of areas such as cognitive psychology, computer vision and visualization, and advertisement. Yet, efforts to create large, downloadable image datasets with diverse content and unbiased groundtruthing are lacking. In this work, we introduce Savoias, a visual complexity dataset that compromises of more than 1,400 images from seven image categories relevant to the above research areas, namely Scenes, Advertisements, Visualization and infographics, Objects, Interior design, Art, and Suprematism. The images in each category portray diverse characteristics including various low-level and high-level features, objects, backgrounds, textures and patterns, text, and graphics. The ground truth for Savoias is obtained by crowdsourcing more than 37,000 pairwise comparisons of images using the forced-choice methodology and with more than 1,600 contributors. The resulting relative scores are then converted to absolute visual complexity scores using the Bradley-Terry method and matrix completion. When applying five state-of-the-art algorithms to analyze the visual complexity of the images in the Savoias dataset, we found that the scores obtained from these baseline tools only correlate well with crowdsourced labels for abstract patterns in the Suprematism category (Pearson correlation r=0.84). For the other categories, in particular, the objects and advertisement categories, low correlation coefficients were revealed (r=0.3 and 0.56, respectively). These findings suggest that (1) state-of-the-art approaches are mostly insufficient and (2) Savoias enables category-specific method development, which is likely to improve the impact of visual complexity analysis on specific application areas, including computer vision.

View on arXiv PDF Code

Similar