CVOct 25, 2021

MUSE: Feature Self-Distillation with Mutual Information and Self-Information

arXiv:2110.12606v1
Originality Incremental advance
AI Analysis

This work addresses the challenge of enhancing feature dependency in CNNs for researchers and practitioners in computer vision, offering a novel method that is versatile for tasks like image classification and object detection, though it builds incrementally on existing knowledge distillation frameworks.

The paper tackles the problem of improving feature expressivity in deep convolutional neural networks by introducing dependency among features using an information-theoretic approach called MUSE, which combines mutual and self-information, achieving superior performance in self-distillation and online distillation across various architectures and tasks.

We present a novel information-theoretic approach to introduce dependency among features of a deep convolutional neural network (CNN). The core idea of our proposed method, called MUSE, is to combine MUtual information and SElf-information to jointly improve the expressivity of all features extracted from different layers in a CNN. We present two variants of the realization of MUSE -- Additive Information and Multiplicative Information. Importantly, we argue and empirically demonstrate that MUSE, compared to other feature discrepancy functions, is a more functional proxy to introduce dependency and effectively improve the expressivity of all features in the knowledge distillation framework. MUSE achieves superior performance over a variety of popular architectures and feature discrepancy functions for self-distillation and online distillation, and performs competitively with the state-of-the-art methods for offline distillation. MUSE is also demonstrably versatile that enables it to be easily extended to CNN-based models on tasks other than image classification such as object detection.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes