CVJan 8, 2019

3D Object Detection Using Scale Invariant and Feature Reweighting Networks

arXiv:1901.02237v1107 citations
Originality Incremental advance
AI Analysis

This work addresses 3D object detection for applications like autonomous driving and robotics, but it is incremental as it builds on existing methods with hybrid improvements.

The paper tackles 3D object detection by proposing a network that uses front view images and frustum point clouds, incorporating PointSIFT for 3D segmentation and SENet for feature reweighting, achieving better performance than state-of-the-art methods, particularly with sparse point clouds.

3D object detection plays an important role in a large number of real-world applications. It requires us to estimate the localizations and the orientations of 3D objects in real scenes. In this paper, we present a new network architecture which focuses on utilizing the front view images and frustum point clouds to generate 3D detection results. On the one hand, a PointSIFT module is utilized to improve the performance of 3D segmentation. It can capture the information from different orientations in space and the robustness to different scale shapes. On the other hand, our network obtains the useful features and suppresses the features with less information by a SENet module. This module reweights channel features and estimates the 3D bounding boxes more effectively. Our method is evaluated on both KITTI dataset for outdoor scenes and SUN-RGBD dataset for indoor scenes. The experimental results illustrate that our method achieves better performance than the state-of-the-art methods especially when point clouds are highly sparse.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes