CVAIJun 30, 2023

GMM: Delving into Gradient Aware and Model Perceive Depth Mining for Monocular 3D Detection

arXiv:2306.17450v11 citationsh-index: 33
Originality Incremental advance
AI Analysis

This work addresses depth accuracy in 3D detection for autonomous driving, offering an incremental improvement through a novel mining strategy.

The paper tackled the problem of depth perception in monocular 3D object detection by proposing a sample mining strategy called GMM, which improved performance on the nuScenes dataset to 42.1% mAP and 47.3% NDS, outperforming other state-of-the-art techniques.

Depth perception is a crucial component of monoc-ular 3D detection tasks that typically involve ill-posed problems. In light of the success of sample mining techniques in 2D object detection, we propose a simple yet effective mining strategy for improving depth perception in 3D object detection. Concretely, we introduce a plain metric to evaluate the quality of depth predictions, which chooses the mined sample for the model. Moreover, we propose a Gradient-aware and Model-perceive Mining strategy (GMM) for depth learning, which exploits the predicted depth quality for better depth learning through easy mining. GMM is a general strategy that can be readily applied to several state-of-the-art monocular 3D detectors, improving the accuracy of depth prediction. Extensive experiments on the nuScenes dataset demonstrate that the proposed methods significantly improve the performance of 3D object detection while outperforming other state-of-the-art sample mining techniques by a considerable margin. On the nuScenes benchmark, GMM achieved the state-of-the-art (42.1% mAP and 47.3% NDS) performance in monocular object detection.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes