CVAug 23, 2021

Separable Convolutions for Optimizing 3D Stereo Networks

arXiv:2108.10216v19 citations
Originality Incremental advance
AI Analysis

This addresses a bottleneck for real-world applications of 3D stereo networks, offering a practical optimization with incremental improvements.

The paper tackles the high computational complexity of 3D stereo networks by showing that 3D convolutions consume up to 94% of operations, and proposes separable convolutions that reduce operations by up to 7x and parameters by up to 3.5x without compromising performance.

Deep learning based 3D stereo networks give superior performance compared to 2D networks and conventional stereo methods. However, this improvement in the performance comes at the cost of increased computational complexity, thus making these networks non-practical for the real-world applications. Specifically, these networks use 3D convolutions as a major work horse to refine and regress disparities. In this work first, we show that these 3D convolutions in stereo networks consume up to 94% of overall network operations and act as a major bottleneck. Next, we propose a set of "plug-&-run" separable convolutions to reduce the number of parameters and operations. When integrated with the existing state of the art stereo networks, these convolutions lead up to 7x reduction in number of operations and up to 3.5x reduction in parameters without compromising their performance. In fact these convolutions lead to improvement in their performance in the majority of cases.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes