CVOct 12, 2018

Real-time self-adaptive deep stereo

arXiv:1810.05424v2290 citations
Originality Incremental advance
AI Analysis

This addresses the domain adaptation issue in stereo vision for applications requiring real-time inference, though it is incremental as it builds on existing deep stereo methods.

The paper tackles the problem of deep stereo networks losing accuracy in new environments by proposing a real-time self-adaptive system that continuously adapts online without retraining, achieving competitive performance on heterogeneous datasets.

Deep convolutional neural networks trained end-to-end are the state-of-the-art methods to regress dense disparity maps from stereo pairs. These models, however, suffer from a notable decrease in accuracy when exposed to scenarios significantly different from the training set, e.g., real vs synthetic images, etc.). We argue that it is extremely unlikely to gather enough samples to achieve effective training/tuning in any target domain, thus making this setup impractical for many applications. Instead, we propose to perform unsupervised and continuous online adaptation of a deep stereo network, which allows for preserving its accuracy in any environment. However, this strategy is extremely computationally demanding and thus prevents real-time inference. We address this issue introducing a new lightweight, yet effective, deep stereo architecture, Modularly ADaptive Network (MADNet) and developing a Modular ADaptation (MAD) algorithm, which independently trains sub-portions of the network. By deploying MADNet together with MAD we introduce the first real-time self-adaptive deep stereo system enabling competitive performance on heterogeneous datasets.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes