CVJan 10, 2024

REACT 2024: the Second Multiple Appropriate Facial Reaction Generation Challenge

arXiv:2401.05166v124 citationsh-index: 34Has CodeFG
Originality Synthesis-oriented
AI Analysis

This work addresses the challenge of realistic non-verbal communication in AI for video conferencing scenarios, but it is incremental as it builds on a previous edition without introducing new methods.

The paper presents the REACT 2024 challenge, which tackles the problem of generating multiple appropriate, diverse, and synchronized facial reactions from unseen speaker behaviors in dyadic interactions, using a dataset from NOXI and RECOLA with baseline systems achieving performance benchmarks for offline and online generation tasks.

In dyadic interactions, humans communicate their intentions and state of mind using verbal and non-verbal cues, where multiple different facial reactions might be appropriate in response to a specific speaker behaviour. Then, how to develop a machine learning (ML) model that can automatically generate multiple appropriate, diverse, realistic and synchronised human facial reactions from an previously unseen speaker behaviour is a challenging task. Following the successful organisation of the first REACT challenge (REACT 2023), this edition of the challenge (REACT 2024) employs a subset used by the previous challenge, which contains segmented 30-secs dyadic interaction clips originally recorded as part of the NOXI and RECOLA datasets, encouraging participants to develop and benchmark Machine Learning (ML) models that can generate multiple appropriate facial reactions (including facial image sequences and their attributes) given an input conversational partner's stimulus under various dyadic video conference scenarios. This paper presents: (i) the guidelines of the REACT 2024 challenge; (ii) the dataset utilized in the challenge; and (iii) the performance of the baseline systems on the two proposed sub-challenges: Offline Multiple Appropriate Facial Reaction Generation and Online Multiple Appropriate Facial Reaction Generation, respectively. The challenge baseline code is publicly available at https://github.com/reactmultimodalchallenge/baseline_react2024.

Code Implementations2 repos
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes