Embody 3D: A Large-scale Multimodal Motion and Behavior Dataset
This dataset provides a valuable resource for researchers in computer vision, robotics, and human-computer interaction by offering extensive single-person and multi-person behavioral data, though it is incremental as it builds on existing motion capture datasets.
The researchers introduced Embody 3D, a large-scale multimodal dataset comprising 500 hours of 3D motion data from 439 participants, totaling over 54 million frames, to address the need for diverse human motion and behavior data in AI applications.
The Codec Avatars Lab at Meta introduces Embody 3D, a multimodal dataset of 500 individual hours of 3D motion data from 439 participants collected in a multi-camera collection stage, amounting to over 54 million frames of tracked 3D motion. The dataset features a wide range of single-person motion data, including prompted motions, hand gestures, and locomotion; as well as multi-person behavioral and conversational data like discussions, conversations in different emotional states, collaborative activities, and co-living scenarios in an apartment-like space. We provide tracked human motion including hand tracking and body shape, text annotations, and a separate audio track for each participant.