CVNov 7, 2024

VAIR: Visuo-Acoustic Implicit Representations for Low-Cost, Multi-Modal Transparent Surface Reconstruction in Indoor Scenes

arXiv:2411.04963v11 citationsh-index: 7ICRA
Originality Incremental advance
AI Analysis

This addresses navigation challenges for mobile robots in indoor environments with transparent surfaces, representing an incremental advance in multi-modal sensing.

The paper tackles transparent surface reconstruction in indoor scenes by fusing acoustic and visual sensing through implicit neural representations, achieving significant improvement over state-of-the-art methods on a new low-cost dataset.

Mobile robots operating indoors must be prepared to navigate challenging scenes that contain transparent surfaces. This paper proposes a novel method for the fusion of acoustic and visual sensing modalities through implicit neural representations to enable dense reconstruction of transparent surfaces in indoor scenes. We propose a novel model that leverages generative latent optimization to learn an implicit representation of indoor scenes consisting of transparent surfaces. We demonstrate that we can query the implicit representation to enable volumetric rendering in image space or 3D geometry reconstruction (point clouds or mesh) with transparent surface prediction. We evaluate our method's effectiveness qualitatively and quantitatively on a new dataset collected using a custom, low-cost sensing platform featuring RGB-D cameras and ultrasonic sensors. Our method exhibits significant improvement over state-of-the-art for transparent surface reconstruction.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes