CVAIGRRODec 15, 2022

Objaverse: A Universe of Annotated 3D Objects

AI2
arXiv:2212.08051v11669 citationsh-index: 77
AI Analysis

This dataset addresses a gap for researchers in 3D vision and AI by providing a large-scale resource, though it is incremental as it builds on existing 3D repositories.

The authors tackled the lack of large-scale 3D datasets in AI by introducing Objaverse 1.0, a dataset with over 800,000 annotated 3D models, which improved tail category segmentation on the LVIS benchmark and enabled diverse applications like generative 3D models and embodied AI.

Massive data corpora like WebText, Wikipedia, Conceptual Captions, WebImageText, and LAION have propelled recent dramatic progress in AI. Large neural models trained on such datasets produce impressive results and top many of today's benchmarks. A notable omission within this family of large-scale datasets is 3D data. Despite considerable interest and potential applications in 3D vision, datasets of high-fidelity 3D models continue to be mid-sized with limited diversity of object categories. Addressing this gap, we present Objaverse 1.0, a large dataset of objects with 800K+ (and growing) 3D models with descriptive captions, tags, and animations. Objaverse improves upon present day 3D repositories in terms of scale, number of categories, and in the visual diversity of instances within a category. We demonstrate the large potential of Objaverse via four diverse applications: training generative 3D models, improving tail category segmentation on the LVIS benchmark, training open-vocabulary object-navigation models for Embodied AI, and creating a new benchmark for robustness analysis of vision models. Objaverse can open new directions for research and enable new applications across the field of AI.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes