CVAIMay 20, 2024

Distill-then-prune: An Efficient Compression Framework for Real-time Stereo Matching Network on Edge Devices

arXiv:2405.11809v18 citationsh-index: 4ICRA
Originality Incremental advance
AI Analysis

This work addresses the problem of efficient stereo matching for edge devices, offering an incremental improvement by integrating existing compression techniques.

The paper tackled the trade-off between speed and accuracy in real-time stereo matching networks by proposing a framework that combines knowledge distillation and model pruning, resulting in a model that maintains real-time performance with high accuracy on edge devices, as validated on Sceneflow and KITTI benchmarks.

In recent years, numerous real-time stereo matching methods have been introduced, but they often lack accuracy. These methods attempt to improve accuracy by introducing new modules or integrating traditional methods. However, the improvements are only modest. In this paper, we propose a novel strategy by incorporating knowledge distillation and model pruning to overcome the inherent trade-off between speed and accuracy. As a result, we obtained a model that maintains real-time performance while delivering high accuracy on edge devices. Our proposed method involves three key steps. Firstly, we review state-of-the-art methods and design our lightweight model by removing redundant modules from those efficient models through a comparison of their contributions. Next, we leverage the efficient model as the teacher to distill knowledge into the lightweight model. Finally, we systematically prune the lightweight model to obtain the final model. Through extensive experiments conducted on two widely-used benchmarks, Sceneflow and KITTI, we perform ablation studies to analyze the effectiveness of each module and present our state-of-the-art results.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes