IV CV OPTICSOct 19, 2022

Video super-resolution for single-photon LIDAR

Germán Mora Martín, Stirling Scholes, Alice Ruget, Robert K. Henderson, Jonathan Leach, Istvan Gyongy

arXiv:2210.10474v18.111 citationsh-index: 30

Originality Synthesis-oriented

AI Analysis

This addresses scene interpretation difficulties for applications like self-driving cars and robotics, but it is incremental as it applies existing deep learning methods to a specific sensor data challenge.

The paper tackles the problem of low lateral resolution and low signal-to-noise ratio in single-photon LIDAR depth maps by training a 3D CNN for denoising and upscaling (x4) depth data, achieving processing at >30 frames per second with GPU acceleration.

3D Time-of-Flight (ToF) image sensors are used widely in applications such as self-driving cars, Augmented Reality (AR) and robotics. When implemented with Single-Photon Avalanche Diodes (SPADs), compact, array format sensors can be made that offer accurate depth maps over long distances, without the need for mechanical scanning. However, array sizes tend to be small, leading to low lateral resolution, which combined with low Signal-to-Noise Ratio (SNR) levels under high ambient illumination, may lead to difficulties in scene interpretation. In this paper, we use synthetic depth sequences to train a 3D Convolutional Neural Network (CNN) for denoising and upscaling (x4) depth data. Experimental results, based on synthetic as well as real ToF data, are used to demonstrate the effectiveness of the scheme. With GPU acceleration, frames are processed at >30 frames per second, making the approach suitable for low-latency imaging, as required for obstacle avoidance.

View on arXiv PDF

Similar