CLCVIRNov 21, 2017

Visual and Textual Sentiment Analysis Using Deep Fusion Convolutional Neural Networks

arXiv:1711.07798v16 citations
Originality Incremental advance
AI Analysis

This addresses sentiment analysis for social media applications, but it is incremental as it builds on existing multimodal fusion approaches.

The paper tackles sentiment analysis by proposing a deep fusion convolutional neural network that jointly learns from visual and textual data, achieving promising results compared to state-of-the-art methods on two datasets.

Sentiment analysis is attracting more and more attentions and has become a very hot research topic due to its potential applications in personalized recommendation, opinion mining, etc. Most of the existing methods are based on either textual or visual data and can not achieve satisfactory results, as it is very hard to extract sufficient information from only one single modality data. Inspired by the observation that there exists strong semantic correlation between visual and textual data in social medias, we propose an end-to-end deep fusion convolutional neural network to jointly learn textual and visual sentiment representations from training examples. The two modality information are fused together in a pooling layer and fed into fully-connected layers to predict the sentiment polarity. We evaluate the proposed approach on two widely used data sets. Results show that our method achieves promising result compared with the state-of-the-art methods which clearly demonstrate its competency.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes