LGAICVFeb 16, 2021

An AutoML-based Approach to Multimodal Image Sentiment Analysis

arXiv:2102.08092v130 citations
AI Analysis

This addresses the challenge of effectively fusing multimodal data for sentiment analysis in social media applications, though it is incremental as it builds on existing AutoML techniques.

The paper tackled the problem of multimodal sentiment analysis by combining textual and image data using an AutoML-based approach, achieving state-of-the-art performance with 95.19% accuracy on the B-T4SA dataset.

Sentiment analysis is a research topic focused on analysing data to extract information related to the sentiment that it causes. Applications of sentiment analysis are wide, ranging from recommendation systems, and marketing to customer satisfaction. Recent approaches evaluate textual content using Machine Learning techniques that are trained over large corpora. However, as social media grown, other data types emerged in large quantities, such as images. Sentiment analysis in images has shown to be a valuable complement to textual data since it enables the inference of the underlying message polarity by creating context and connections. Multimodal sentiment analysis approaches intend to leverage information of both textual and image content to perform an evaluation. Despite recent advances, current solutions still flounder in combining both image and textual information to classify social media data, mainly due to subjectivity, inter-class homogeneity and fusion data differences. In this paper, we propose a method that combines both textual and image individual sentiment analysis into a final fused classification based on AutoML, that performs a random search to find the best model. Our method achieved state-of-the-art performance in the B-T4SA dataset, with 95.19% accuracy.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes