HCAILGAug 25, 2022

Supervised Contrastive Learning for Affect Modelling

arXiv:2208.12238v115 citationsh-index: 59
Originality Incremental advance
AI Analysis

This addresses affect modeling for multimodal user input, but it is incremental as it applies an existing contrastive learning paradigm to a specific domain.

The paper tackled affect modeling by training subject-invariant representations using supervised contrastive learning, which boosted accuracy for arousal prediction on the RECOLA dataset compared to end-to-end methods.

Affect modeling is viewed, traditionally, as the process of mapping measurable affect manifestations from multiple modalities of user input to affect labels. That mapping is usually inferred through end-to-end (manifestation-to-affect) machine learning processes. What if, instead, one trains general, subject-invariant representations that consider affect information and then uses such representations to model affect? In this paper we assume that affect labels form an integral part, and not just the training signal, of an affect representation and we explore how the recent paradigm of contrastive learning can be employed to discover general high-level affect-infused representations for the purpose of modeling affect. We introduce three different supervised contrastive learning approaches for training representations that consider affect information. In this initial study we test the proposed methods for arousal prediction in the RECOLA dataset based on user information from multiple modalities. Results demonstrate the representation capacity of contrastive learning and its efficiency in boosting the accuracy of affect models. Beyond their evidenced higher performance compared to end-to-end arousal classification, the resulting representations are general-purpose and subject-agnostic, as training is guided though general affect information available in any multimodal corpus.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes