CVJan 24, 2024

FoVA-Depth: Field-of-View Agnostic Depth Estimation for Cross-Dataset Generalization

arXiv:2401.13786v18 citations3DV
Originality Incremental advance
AI Analysis

This enables depth estimation for automotive and robotics applications with wide-FoV cameras, addressing a data availability bottleneck.

The paper tackles the problem of training stereo depth estimation models for wide field-of-view cameras using only pinhole camera data, achieving strong generalization across indoor and outdoor datasets where previous methods failed.

Wide field-of-view (FoV) cameras efficiently capture large portions of the scene, which makes them attractive in multiple domains, such as automotive and robotics. For such applications, estimating depth from multiple images is a critical task, and therefore, a large amount of ground truth (GT) data is available. Unfortunately, most of the GT data is for pinhole cameras, making it impossible to properly train depth estimation models for large-FoV cameras. We propose the first method to train a stereo depth estimation model on the widely available pinhole data, and to generalize it to data captured with larger FoVs. Our intuition is simple: We warp the training data to a canonical, large-FoV representation and augment it to allow a single network to reason about diverse types of distortions that otherwise would prevent generalization. We show strong generalization ability of our approach on both indoor and outdoor datasets, which was not possible with previous methods.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes