CV AI LG ROApr 14, 2023

Self-Supervised Learning based Depth Estimation from Monocular Images

Mayank Poddar, Akash Mishra, Mohit Kewlani, Haoyang Pei

arXiv:2304.06966v11.52 citationsh-index: 5Has Code

Originality Synthesis-oriented

AI Analysis

This work addresses depth estimation for applications like self-driving cars, but it appears incremental as it builds on existing methods without claiming major breakthroughs.

The paper tackles monocular depth estimation by proposing extensions to existing state-of-the-art deep learning models, aiming to improve performance metrics through techniques like pose estimation and semantic segmentation.

Depth Estimation has wide reaching applications in the field of Computer vision such as target tracking, augmented reality, and self-driving cars. The goal of Monocular Depth Estimation is to predict the depth map, given a 2D monocular RGB image as input. The traditional depth estimation methods are based on depth cues and used concepts like epipolar geometry. With the evolution of Convolutional Neural Networks, depth estimation has undergone tremendous strides. In this project, our aim is to explore possible extensions to existing SoTA Deep Learning based Depth Estimation Models and to see whether performance metrics could be further improved. In a broader sense, we are looking at the possibility of implementing Pose Estimation, Efficient Sub-Pixel Convolution Interpolation, Semantic Segmentation Estimation techniques to further enhance our proposed architecture and to provide fine-grained and more globally coherent depth map predictions. We also plan to do away with camera intrinsic parameters during training and apply weather augmentations to further generalize our model.

View on arXiv PDF Code

Similar