CVMar 27, 2023

Hi4D: 4D Instance Segmentation of Close Human Interaction

arXiv:2303.15380v199 citationsh-index: 42
Originality Incremental advance
AI Analysis

This addresses the challenge of disentangling in-contact subjects in computer vision, with incremental improvements in dataset creation and method optimization.

The authors tackled the problem of automatically segmenting and analyzing physically close human-human interactions in 4D, resulting in the Hi4D dataset with over 11K frames and state-of-the-art benchmarks for pose and shape estimation.

We propose Hi4D, a method and dataset for the automatic analysis of physically close human-human interaction under prolonged contact. Robustly disentangling several in-contact subjects is a challenging task due to occlusions and complex shapes. Hence, existing multi-view systems typically fuse 3D surfaces of close subjects into a single, connected mesh. To address this issue we leverage i) individually fitted neural implicit avatars; ii) an alternating optimization scheme that refines pose and surface through periods of close proximity; and iii) thus segment the fused raw scans into individual instances. From these instances we compile Hi4D dataset of 4D textured scans of 20 subject pairs, 100 sequences, and a total of more than 11K frames. Hi4D contains rich interaction-centric annotations in 2D and 3D alongside accurately registered parametric body models. We define varied human pose and shape estimation tasks on this dataset and provide results from state-of-the-art methods on these benchmarks.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes