CVApr 18, 2018

Falling Things: A Synthetic Dataset for 3D Object Detection and Pose Estimation

arXiv:1804.06534v2210 citations
Originality Synthesis-oriented
AI Analysis

This dataset addresses the data scarcity issue for researchers in robotics and computer vision, though it is incremental as it builds on existing object models and synthetic generation techniques.

The authors tackled the problem of limited annotated data for 3D object detection and pose estimation in robotics by creating a synthetic dataset called Falling Things (FAT), which includes 60k annotated photos of 21 household objects with accurate 3D poses and other annotations.

We present a new dataset, called Falling Things (FAT), for advancing the state-of-the-art in object detection and 3D pose estimation in the context of robotics. By synthetically combining object models and backgrounds of complex composition and high graphical quality, we are able to generate photorealistic images with accurate 3D pose annotations for all objects in all images. Our dataset contains 60k annotated photos of 21 household objects taken from the YCB dataset. For each image, we provide the 3D poses, per-pixel class segmentation, and 2D/3D bounding box coordinates for all objects. To facilitate testing different input modalities, we provide mono and stereo RGB images, along with registered dense depth images. We describe in detail the generation process and statistical analysis of the data.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes