IVCVNov 16, 2022

Consistent Direct Time-of-Flight Video Depth Super-Resolution

arXiv:2211.08658v224 citationsh-index: 14Has Code
AI Analysis

This work addresses a key bottleneck for on-device 3D sensing in mobile applications, offering a novel multi-frame approach to enhance depth resolution, though it is incremental relative to existing RGB-guided methods.

The paper tackles the problem of super-resolving low-resolution direct time-of-flight (dToF) depth data for mobile devices by fusing it with high-resolution RGB video, achieving improved depth accuracy in dynamic indoor environments, with results demonstrated on a new synthetic dataset.

Direct time-of-flight (dToF) sensors are promising for next-generation on-device 3D sensing. However, limited by manufacturing capabilities in a compact module, the dToF data has a low spatial resolution (e.g., $\sim 20\times30$ for iPhone dToF), and it requires a super-resolution step before being passed to downstream tasks. In this paper, we solve this super-resolution problem by fusing the low-resolution dToF data with the corresponding high-resolution RGB guidance. Unlike the conventional RGB-guided depth enhancement approaches, which perform the fusion in a per-frame manner, we propose the first multi-frame fusion scheme to mitigate the spatial ambiguity resulting from the low-resolution dToF imaging. In addition, dToF sensors provide unique depth histogram information for each local patch, and we incorporate this dToF-specific feature in our network design to further alleviate spatial ambiguity. To evaluate our models on complex dynamic indoor environments and to provide a large-scale dToF sensor dataset, we introduce DyDToF, the first synthetic RGB-dToF video dataset that features dynamic objects and a realistic dToF simulator following the physical imaging process. We believe the methods and dataset are beneficial to a broad community as dToF depth sensing is becoming mainstream on mobile devices. Our code and data are publicly available: https://github.com/facebookresearch/DVSR/

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes