CVLGIVJun 18, 2022

Analysis & Computational Complexity Reduction of Monocular and Stereo Depth Estimation Techniques

arXiv:2206.09071v1h-index: 3
Originality Synthesis-oriented
AI Analysis

This work addresses the need for efficient, real-time depth estimation in battery-operated autonomous systems, though it is incremental as it builds on existing methods like AnyNet.

The paper tackled the problem of reducing computational complexity in monocular and stereo depth estimation for autonomous systems, showing that model size reductions of ~75% for monocular and ~20% for stereo methods resulted in accuracy losses of less than 2% and 3%, respectively.

Accurate depth estimation with lowest compute and energy cost is a crucial requirement for unmanned and battery operated autonomous systems. Robotic applications require real time depth estimation for navigation and decision making under rapidly changing 3D surroundings. A high accuracy algorithm may provide the best depth estimation but may consume tremendous compute and energy resources. A general trade-off is to choose less accurate methods for initial depth estimate and a more accurate yet compute intensive method when needed. Previous work has shown this trade-off can be improved by developing a state-of-the-art method (AnyNet) to improve stereo depth estimation. We studied both the monocular and stereo vision depth estimation methods and investigated methods to reduce computational complexity of these methods. This was our baseline. Consequently, our experiments show reduction of monocular depth estimation model size by ~75% reduces accuracy by less than 2% (SSIM metric). Our experiments with the novel stereo vision method (AnyNet) show that accuracy of depth estimation does not degrade more than 3% (three pixel error metric) in spite of reduction in model size by ~20%. We have shown that smaller models can indeed perform competitively.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes