CVNov 5, 2020

Towards Keypoint Guided Self-Supervised Depth Estimation

arXiv:2011.03091v1
AI Analysis

This is an incremental improvement for computer vision tasks like autonomous driving or robotics where accurate depth estimation is needed.

The paper tackles depth estimation from images without ground truth by proposing a self-supervised method that uses keypoints for reprojection instead of individual pixels, showing that this approach improves depth estimation learning.

This paper proposes to use keypoints as a self-supervision clue for learning depth map estimation from a collection of input images. As ground truth depth from real images is difficult to obtain, there are many unsupervised and self-supervised approaches to depth estimation that have been proposed. Most of these unsupervised approaches use depth map and ego-motion estimations to reproject the pixels from the current image into the adjacent image from the image collection. Depth and ego-motion estimations are evaluated based on pixel intensity differences between the correspondent original and reprojected pixels. Instead of reprojecting the individual pixels, we propose to first select image keypoints in both images and then reproject and compare the correspondent keypoints of the two images. The keypoints should describe the distinctive image features well. By learning a deep model with and without the keypoint extraction technique, we show that using the keypoints improve the depth estimation learning. We also propose some future directions for keypoint-guided learning of structure-from-motion problems.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes