CVMar 17, 2023

Semantic Scene Completion with Cleaner Self

arXiv:2303.09977v122 citationsh-index: 30
Originality Incremental advance
AI Analysis

This work addresses a domain-specific problem in 3D scene understanding for robotics or autonomous systems, with incremental improvements over existing methods.

The paper tackles the problem of incomplete and confused predictions in Semantic Scene Completion (SSC) caused by noisy depth data, by training a cleaner model on perfect surface data and distilling its knowledge into a noisy model, resulting in improvements of 3.1% IoU and 2.2% mIoU and achieving state-of-the-art accuracy on the NYU dataset.

Semantic Scene Completion (SSC) transforms an image of single-view depth and/or RGB 2D pixels into 3D voxels, each of whose semantic labels are predicted. SSC is a well-known ill-posed problem as the prediction model has to "imagine" what is behind the visible surface, which is usually represented by Truncated Signed Distance Function (TSDF). Due to the sensory imperfection of the depth camera, most existing methods based on the noisy TSDF estimated from depth values suffer from 1) incomplete volumetric predictions and 2) confused semantic labels. To this end, we use the ground-truth 3D voxels to generate a perfect visible surface, called TSDF-CAD, and then train a "cleaner" SSC model. As the model is noise-free, it is expected to focus more on the "imagination" of unseen voxels. Then, we propose to distill the intermediate "cleaner" knowledge into another model with noisy TSDF input. In particular, we use the 3D occupancy feature and the semantic relations of the "cleaner self" to supervise the counterparts of the "noisy self" to respectively address the above two incorrect predictions. Experimental results validate that our method improves the noisy counterparts with 3.1% IoU and 2.2% mIoU for measuring scene completion and SSC, and also achieves new state-of-the-art accuracy on the popular NYU dataset.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes