CVMar 11, 2019

Structured Knowledge Distillation for Dense Prediction

arXiv:1903.04197v7696 citationsHas Code
Originality Incremental advance
AI Analysis

This work addresses a bottleneck in knowledge distillation for computer vision by introducing structured approaches, offering incremental improvements for researchers and practitioners in dense prediction.

The authors tackled the problem of sub-optimal performance in dense prediction tasks like semantic segmentation by proposing structured knowledge distillation methods that transfer structure information from large to compact networks, achieving improved results across three tasks.

In this work, we consider transferring the structure information from large networks to compact ones for dense prediction tasks in computer vision. Previous knowledge distillation strategies used for dense prediction tasks often directly borrow the distillation scheme for image classification and perform knowledge distillation for each pixel separately, leading to sub-optimal performance. Here we propose to distill structured knowledge from large networks to compact networks, taking into account the fact that dense prediction is a structured prediction problem. Specifically, we study two structured distillation schemes: i) pair-wise distillation that distills the pair-wise similarities by building a static graph; and ii) holistic distillation that uses adversarial training to distill holistic knowledge. The effectiveness of our knowledge distillation approaches is demonstrated by experiments on three dense prediction tasks: semantic segmentation, depth estimation and object detection. Code is available at: https://git.io/StructKD

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes