CVJul 15, 2022

3D Instances as 1D Kernels

arXiv:2207.07372v249 citationsh-index: 36Has Code
AI Analysis

This work addresses 3D instance segmentation for point cloud data, offering a novel approach that reduces reliance on proposals or clustering, though it builds on existing dynamic convolution methods.

The paper tackles 3D instance segmentation by representing instances as 1D kernels that encode semantic, positional, and shape information, enabling mask inference through kernel scanning and dynamic convolutions, and it outperforms state-of-the-art methods on ScanNetV2 and S3DIS datasets with improved instance localization.

We introduce a 3D instance representation, termed instance kernels, where instances are represented by one-dimensional vectors that encode the semantic, positional, and shape information of 3D instances. We show that instance kernels enable easy mask inference by simply scanning kernels over the entire scenes, avoiding the heavy reliance on proposals or heuristic clustering algorithms in standard 3D instance segmentation pipelines. The idea of instance kernel is inspired by recent success of dynamic convolutions in 2D/3D instance segmentation. However, we find it non-trivial to represent 3D instances due to the disordered and unstructured nature of point cloud data, e.g., poor instance localization can significantly degrade instance representation. To remedy this, we construct a novel 3D instance encoding paradigm. First, potential instance centroids are localized as candidates. Then, a candidate merging scheme is devised to simultaneously aggregate duplicated candidates and collect context around the merged centroids to form the instance kernels. Once instance kernels are available, instance masks can be reconstructed via dynamic convolutions whose weights are conditioned on instance kernels. The whole pipeline is instantiated with a dynamic kernel network (DKNet). Results show that DKNet outperforms the state of the arts on both ScanNetV2 and S3DIS datasets with better instance localization. Code is available: https://github.com/W1zheng/DKNet.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes