CV LG ROAug 2, 2019

Distilling Knowledge From a Deep Pose Regressor Network

Muhamad Risqi U. Saputra, Pedro P. B. de Gusmao, Yasin Almalioglu, Andrew Markham, Niki Trigoni

arXiv:1908.00858v116.9117 citations

Originality Incremental advance

AI Analysis

This addresses the challenge of efficient Visual Odometry for robotics or autonomous systems, presenting a novel distillation approach for pose regression, though it is incremental in adapting existing distillation techniques to a specific domain.

The paper tackles the problem of distilling knowledge from a deep pose regressor network for efficient Visual Odometry by proposing a method that uses teacher loss as a confidence score to emphasize knowledge transfer only when the teacher is trusted, achieving up to 92.95% parameter reduction and 2.12x faster computation time while keeping student predictions close to the teacher.

This paper presents a novel method to distill knowledge from a deep pose regressor network for efficient Visual Odometry (VO). Standard distillation relies on "dark knowledge" for successful knowledge transfer. As this knowledge is not available in pose regression and the teacher prediction is not always accurate, we propose to emphasize the knowledge transfer only when we trust the teacher. We achieve this by using teacher loss as a confidence score which places variable relative importance on the teacher prediction. We inject this confidence score to the main training task via Attentive Imitation Loss (AIL) and when learning the intermediate representation of the teacher through Attentive Hint Training (AHT) approach. To the best of our knowledge, this is the first work which successfully distill the knowledge from a deep pose regression network. Our evaluation on the KITTI and Malaga dataset shows that we can keep the student prediction close to the teacher with up to 92.95% parameter reduction and 2.12x faster in computation time.

View on arXiv PDF

Similar