CVJul 7, 2024

SCIPaD: Incorporating Spatial Clues into Unsupervised Pose-Depth Joint Learning

arXiv:2407.05283v115 citationsh-index: 11Has Code
Originality Incremental advance
AI Analysis

This addresses a key bottleneck in autonomous driving by improving pose estimation in dynamic scenarios, though it is an incremental advance over existing methods.

The paper tackles the problem of inaccurate camera pose estimation in unsupervised monocular depth learning for autonomous driving, resulting in a 22.2% reduction in translation error and 34.8% reduction in angular error on the KITTI Odometry dataset.

Unsupervised monocular depth estimation frameworks have shown promising performance in autonomous driving. However, existing solutions primarily rely on a simple convolutional neural network for ego-motion recovery, which struggles to estimate precise camera poses in dynamic, complicated real-world scenarios. These inaccurately estimated camera poses can inevitably deteriorate the photometric reconstruction and mislead the depth estimation networks with wrong supervisory signals. In this article, we introduce SCIPaD, a novel approach that incorporates spatial clues for unsupervised depth-pose joint learning. Specifically, a confidence-aware feature flow estimator is proposed to acquire 2D feature positional translations and their associated confidence levels. Meanwhile, we introduce a positional clue aggregator, which integrates pseudo 3D point clouds from DepthNet and 2D feature flows into homogeneous positional representations. Finally, a hierarchical positional embedding injector is proposed to selectively inject spatial clues into semantic features for robust camera pose decoding. Extensive experiments and analyses demonstrate the superior performance of our model compared to other state-of-the-art methods. Remarkably, SCIPaD achieves a reduction of 22.2\% in average translation error and 34.8\% in average angular error for camera pose estimation task on the KITTI Odometry dataset. Our source code is available at \url{https://mias.group/SCIPaD}.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes