CVAIMar 18, 2024

ReGenNet: Towards Human Action-Reaction Synthesis

arXiv:2403.11882v159 citationsh-index: 25CVPR
Originality Incremental advance
AI Analysis

This work addresses the need for realistic human-human interaction synthesis in applications like animation and virtual reality, but it is incremental as it builds on existing datasets and generative methods.

The paper tackles the problem of synthesizing dynamic human reactions to given human actions, which is less explored compared to static human-scene interactions, and proposes ReGenNet, a diffusion-based generative model that generates plausible human reactions with quantitative improvements over baselines.

Humans constantly interact with their surrounding environments. Current human-centric generative models mainly focus on synthesizing humans plausibly interacting with static scenes and objects, while the dynamic human action-reaction synthesis for ubiquitous causal human-human interactions is less explored. Human-human interactions can be regarded as asymmetric with actors and reactors in atomic interaction periods. In this paper, we comprehensively analyze the asymmetric, dynamic, synchronous, and detailed nature of human-human interactions and propose the first multi-setting human action-reaction synthesis benchmark to generate human reactions conditioned on given human actions. To begin with, we propose to annotate the actor-reactor order of the interaction sequences for the NTU120, InterHuman, and Chi3D datasets. Based on them, a diffusion-based generative model with a Transformer decoder architecture called ReGenNet together with an explicit distance-based interaction loss is proposed to predict human reactions in an online manner, where the future states of actors are unavailable to reactors. Quantitative and qualitative results show that our method can generate instant and plausible human reactions compared to the baselines, and can generalize to unseen actor motions and viewpoint changes.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes