CVMar 13, 2023

OSIS: Efficient One-stage Network for 3D Instance Segmentation

arXiv:2303.07011v12 citationsh-index: 4
Originality Highly original
AI Analysis

This addresses the problem of slow inference speeds in 3D instance segmentation for applications like robotics and AR/VR, representing an incremental improvement by streamlining the process.

The paper tackles the inefficiency of multi-stage 3D instance segmentation methods by proposing OSIS, a one-stage network that directly segments instances from 3D point clouds, achieving an inference speed of 138ms per scene and excellent performance on indoor datasets.

Current 3D instance segmentation models generally use multi-stage methods to extract instance objects, including clustering, feature extraction, and post-processing processes. However, these multi-stage approaches rely on hyperparameter settings and hand-crafted processes, which restrict the inference speed of the model. In this paper, we propose a new 3D point cloud instance segmentation network, named OSIS. OSIS is a one-stage network, which directly segments instances from 3D point cloud data using neural network. To segment instances directly from the network, we propose an instance decoder, which decodes instance features from the network into instance segments. Our proposed OSIS realizes the end-to-end training by bipartite matching, therefore, our network does not require computationally expensive post-processing steps such as non maximum suppression (NMS) and clustering during inference. The results show that our network finally achieves excellent performance in the commonly used indoor scene instance segmentation dataset, and the inference speed of our network is only an average of 138ms per scene, which substantially exceeds the previous fastest method.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes