CVRONov 25, 2022

Unsupervised Continual Semantic Adaptation through Neural Rendering

arXiv:2211.13969v221 citationsh-index: 129
Originality Incremental advance
AI Analysis

This addresses the challenge of continual adaptation for perception tasks in robotics or AR/VR, though it is incremental as it builds on existing NeRF and adaptation techniques.

The paper tackles the problem of adapting semantic segmentation models across multiple scenes without ground-truth labels during deployment, achieving improved performance by using Semantic-NeRF networks to generate pseudo-labels and reduce forgetting, as demonstrated by outperforming baselines on ScanNet.

An increasing amount of applications rely on data-driven models that are deployed for perception tasks across a sequence of scenes. Due to the mismatch between training and deployment data, adapting the model on the new scenes is often crucial to obtain good performance. In this work, we study continual multi-scene adaptation for the task of semantic segmentation, assuming that no ground-truth labels are available during deployment and that performance on the previous scenes should be maintained. We propose training a Semantic-NeRF network for each scene by fusing the predictions of a segmentation model and then using the view-consistent rendered semantic labels as pseudo-labels to adapt the model. Through joint training with the segmentation model, the Semantic-NeRF model effectively enables 2D-3D knowledge transfer. Furthermore, due to its compact size, it can be stored in a long-term memory and subsequently used to render data from arbitrary viewpoints to reduce forgetting. We evaluate our approach on ScanNet, where we outperform both a voxel-based baseline and a state-of-the-art unsupervised domain adaptation method.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes