CL LG MLJul 11, 2018

Seq2Seq2Sentiment: Multimodal Sequence to Sequence Models for Sentiment Analysis

Hai Pham, Thomas Manzini, Paul Pu Liang, Barnabas Poczos

arXiv:1807.03915v232.11102 citations

Originality Incremental advance

AI Analysis

This work addresses sentiment analysis for multimodal data, offering incremental improvements in representation learning.

The paper tackles multimodal sentiment analysis by proposing unsupervised Seq2Seq models for learning joint representations, achieving a 12-point F1 score improvement over baselines on the CMU-MOSI dataset in bimodal cases.

Multimodal machine learning is a core research area spanning the language, visual and acoustic modalities. The central challenge in multimodal learning involves learning representations that can process and relate information from multiple modalities. In this paper, we propose two methods for unsupervised learning of joint multimodal representations using sequence to sequence (Seq2Seq) methods: a \textit{Seq2Seq Modality Translation Model} and a \textit{Hierarchical Seq2Seq Modality Translation Model}. We also explore multiple different variations on the multimodal inputs and outputs of these seq2seq models. Our experiments on multimodal sentiment analysis using the CMU-MOSI dataset indicate that our methods learn informative multimodal representations that outperform the baselines and achieve improved performance on multimodal sentiment analysis, specifically in the Bimodal case where our model is able to improve F1 Score by twelve points. We also discuss future directions for multimodal Seq2Seq methods.

View on arXiv PDF

Similar