MMCVOct 3, 2019

Affective Computing for Large-Scale Heterogeneous Multimedia Data: A Survey

arXiv:1911.05609v179 citations
Originality Synthesis-oriented
AI Analysis

It addresses the need for affective computing in multimedia data for applications like retrieval and human behavior understanding, but it is incremental as a survey paper.

This survey comprehensively reviews state-of-the-art affective computing technologies for managing and understanding large-scale heterogeneous multimedia data, covering emotion representation models, datasets, and methods across images, music, videos, and multimodal data.

The wide popularity of digital photography and social networks has generated a rapidly growing volume of multimedia data (i.e., image, music, and video), resulting in a great demand for managing, retrieving, and understanding these data. Affective computing (AC) of these data can help to understand human behaviors and enable wide applications. In this article, we survey the state-of-the-art AC technologies comprehensively for large-scale heterogeneous multimedia data. We begin this survey by introducing the typical emotion representation models from psychology that are widely employed in AC. We briefly describe the available datasets for evaluating AC algorithms. We then summarize and compare the representative methods on AC of different multimedia types, i.e., images, music, videos, and multimodal data, with the focus on both handcrafted features-based methods and deep learning methods. Finally, we discuss some challenges and future directions for multimedia affective computing.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes