CVJul 5, 2023

MRecGen: Multimodal Appropriate Reaction Generator

arXiv:2307.02609v13 citationsh-index: 32Has Code
Originality Incremental advance
AI Analysis

This enables more natural human-computer interaction by generating appropriate behaviors for virtual agents or robots.

The paper tackles the challenge of generating multiple appropriate verbal and non-verbal human reactions to a given behavior, resulting in a framework that produces synchronized text, audio, and video streams for realistic human-style responses.

Verbal and non-verbal human reaction generation is a challenging task, as different reactions could be appropriate for responding to the same behaviour. This paper proposes the first multiple and multimodal (verbal and nonverbal) appropriate human reaction generation framework that can generate appropriate and realistic human-style reactions (displayed in the form of synchronised text, audio and video streams) in response to an input user behaviour. This novel technique can be applied to various human-computer interaction scenarios by generating appropriate virtual agent/robot behaviours. Our demo is available at \url{https://github.com/SSYSteve/MRecGen}.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes