CVAIMMNov 25, 2020

Emotional Semantics-Preserved and Feature-Aligned CycleGAN for Visual Emotion Adaptation

arXiv:2011.12470v13 citations
AI Analysis

This research provides an incremental improvement for researchers and practitioners working on visual emotion analysis, particularly in scenarios with domain shifts and limited labeled data in target domains.

This paper addresses unsupervised domain adaptation (UDA) in visual emotion analysis, focusing on emotion distribution learning and dominant emotion classification. The authors developed CycleEmotionGAN++, an end-to-end cycle-consistent adversarial model, which significantly improved performance compared to state-of-the-art UDA approaches on Flickr-LDL & Twitter-LDL and ArtPhoto & FI datasets.

Thanks to large-scale labeled training data, deep neural networks (DNNs) have obtained remarkable success in many vision and multimedia tasks. However, because of the presence of domain shift, the learned knowledge of the well-trained DNNs cannot be well generalized to new domains or datasets that have few labels. Unsupervised domain adaptation (UDA) studies the problem of transferring models trained on one labeled source domain to another unlabeled target domain. In this paper, we focus on UDA in visual emotion analysis for both emotion distribution learning and dominant emotion classification. Specifically, we design a novel end-to-end cycle-consistent adversarial model, termed CycleEmotionGAN++. First, we generate an adapted domain to align the source and target domains on the pixel-level by improving CycleGAN with a multi-scale structured cycle-consistency loss. During the image translation, we propose a dynamic emotional semantic consistency loss to preserve the emotion labels of the source images. Second, we train a transferable task classifier on the adapted domain with feature-level alignment between the adapted and target domains. We conduct extensive UDA experiments on the Flickr-LDL & Twitter-LDL datasets for distribution learning and ArtPhoto & FI datasets for emotion classification. The results demonstrate the significant improvements yielded by the proposed CycleEmotionGAN++ as compared to state-of-the-art UDA approaches.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes