CVJun 11, 2023

REACT2023: the first Multi-modal Multiple Appropriate Facial Reaction Generation Challenge

arXiv:2306.06583v112 citationsh-index: 70Has Code
Originality Synthesis-oriented
AI Analysis

This provides a foundational benchmark for researchers in affective computing to compare methods for facial reaction generation, though it is incremental as it builds on existing multimedia processing techniques.

The REACT2023 challenge tackled the problem of generating appropriate facial reactions in dyadic interactions by creating the first benchmark test set for multi-modal processing, with baseline systems evaluated on offline and online sub-challenges.

The Multi-modal Multiple Appropriate Facial Reaction Generation Challenge (REACT2023) is the first competition event focused on evaluating multimedia processing and machine learning techniques for generating human-appropriate facial reactions in various dyadic interaction scenarios, with all participants competing strictly under the same conditions. The goal of the challenge is to provide the first benchmark test set for multi-modal information processing and to foster collaboration among the audio, visual, and audio-visual affective computing communities, to compare the relative merits of the approaches to automatic appropriate facial reaction generation under different spontaneous dyadic interaction conditions. This paper presents: (i) novelties, contributions and guidelines of the REACT2023 challenge; (ii) the dataset utilized in the challenge; and (iii) the performance of baseline systems on the two proposed sub-challenges: Offline Multiple Appropriate Facial Reaction Generation and Online Multiple Appropriate Facial Reaction Generation, respectively. The challenge baseline code is publicly available at \url{https://github.com/reactmultimodalchallenge/baseline_react2023}.

Code Implementations2 repos
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes