CVAIROJun 12, 2024

OpenObj: Open-Vocabulary Object-Level Neural Radiance Fields with Fine-Grained Understanding

arXiv:2406.08009v114 citations
Originality Incremental advance
AI Analysis

This work addresses limitations in 3D scene reconstruction for robotics and vision applications by enabling detailed object-level understanding, though it appears incremental as it builds on existing NeRF and VLM methods.

The paper tackled the problem of open-vocabulary 3D scene reconstruction by introducing OpenObj, an approach for object-level Neural Radiance Fields with fine-grained understanding, achieving superior performance in zero-shot semantic segmentation and retrieval tasks on multiple datasets.

In recent years, there has been a surge of interest in open-vocabulary 3D scene reconstruction facilitated by visual language models (VLMs), which showcase remarkable capabilities in open-set retrieval. However, existing methods face some limitations: they either focus on learning point-wise features, resulting in blurry semantic understanding, or solely tackle object-level reconstruction, thereby overlooking the intricate details of the object's interior. To address these challenges, we introduce OpenObj, an innovative approach to build open-vocabulary object-level Neural Radiance Fields (NeRF) with fine-grained understanding. In essence, OpenObj establishes a robust framework for efficient and watertight scene modeling and comprehension at the object-level. Moreover, we incorporate part-level features into the neural fields, enabling a nuanced representation of object interiors. This approach captures object-level instances while maintaining a fine-grained understanding. The results on multiple datasets demonstrate that OpenObj achieves superior performance in zero-shot semantic segmentation and retrieval tasks. Additionally, OpenObj supports real-world robotics tasks at multiple scales, including global movement and local manipulation.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes