CVAILGROApr 14, 2023

Self-Supervised Learning based Depth Estimation from Monocular Images

arXiv:2304.06966v12 citationsh-index: 5
Originality Synthesis-oriented
AI Analysis

This work addresses depth estimation for applications like self-driving cars, but it appears incremental as it builds on existing methods without claiming major breakthroughs.

The paper tackles monocular depth estimation by proposing extensions to existing state-of-the-art deep learning models, aiming to improve performance metrics through techniques like pose estimation and semantic segmentation.

Depth Estimation has wide reaching applications in the field of Computer vision such as target tracking, augmented reality, and self-driving cars. The goal of Monocular Depth Estimation is to predict the depth map, given a 2D monocular RGB image as input. The traditional depth estimation methods are based on depth cues and used concepts like epipolar geometry. With the evolution of Convolutional Neural Networks, depth estimation has undergone tremendous strides. In this project, our aim is to explore possible extensions to existing SoTA Deep Learning based Depth Estimation Models and to see whether performance metrics could be further improved. In a broader sense, we are looking at the possibility of implementing Pose Estimation, Efficient Sub-Pixel Convolution Interpolation, Semantic Segmentation Estimation techniques to further enhance our proposed architecture and to provide fine-grained and more globally coherent depth map predictions. We also plan to do away with camera intrinsic parameters during training and apply weather augmentations to further generalize our model.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes