CVROMay 27, 2021

3D Segmentation Learning from Sparse Annotations and Hierarchical Descriptors

arXiv:2105.12885v2
Originality Incremental advance
AI Analysis

This addresses the high annotation cost for 3D segmentation, offering a more efficient solution for researchers and practitioners in computer vision, though it appears incremental as it builds on existing sparse annotation methods.

The paper tackles the problem of expensive point-wise annotations for 3D semantic segmentation by proposing GIDSeg, which learns from sparse annotations using hierarchical descriptors and adversarial learning, achieving superior performance with only 5% annotations compared to state-of-the-art methods.

One of the main obstacles to 3D semantic segmentation is the significant amount of endeavor required to generate expensive point-wise annotations for fully supervised training. To alleviate manual efforts, we propose GIDSeg, a novel approach that can simultaneously learn segmentation from sparse annotations via reasoning global-regional structures and individual-vicinal properties. GIDSeg depicts global- and individual- relation via a dynamic edge convolution network coupled with a kernelized identity descriptor. The ensemble effects are obtained by endowing a fine-grained receptive field to a low-resolution voxelized map. In our GIDSeg, an adversarial learning module is also designed to further enhance the conditional constraint of identity descriptors within the joint feature distribution. Despite the apparent simplicity, our proposed approach achieves superior performance over state-of-the-art for inferencing 3D dense segmentation with only sparse annotations. Particularly, with $5\%$ annotations of raw data, GIDSeg outperforms other 3D segmentation methods.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes