CVDec 18, 2020

Objectron: A Large Scale Dataset of Object-Centric Videos in the Wild with Pose Annotations

arXiv:2012.09988v1226 citations
AI Analysis

This dataset addresses the lack of large-scale, real-world 3D object detection data for researchers and developers in robotics, augmented reality, and autonomy.

This paper introduces Objectron, a large-scale dataset of object-centric short videos with pose annotations for nine categories. It contains 4 million annotated images across 14,819 videos, aiming to advance 3D object detection and related research.

3D object detection has recently become popular due to many applications in robotics, augmented reality, autonomy, and image retrieval. We introduce the Objectron dataset to advance the state of the art in 3D object detection and foster new research and applications, such as 3D object tracking, view synthesis, and improved 3D shape representation. The dataset contains object-centric short videos with pose annotations for nine categories and includes 4 million annotated images in 14,819 annotated videos. We also propose a new evaluation metric, 3D Intersection over Union, for 3D object detection. We demonstrate the usefulness of our dataset in 3D object detection tasks by providing baseline models trained on this dataset. Our dataset and evaluation source code are available online at http://www.objectron.dev

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes