LGMar 14, 2023

A Contrastive Knowledge Transfer Framework for Model Compression and Transfer Learning

arXiv:2303.07599v12 citationsh-index: 11
Originality Incremental advance
AI Analysis

This work addresses the problem of inefficient knowledge transfer in deep learning for researchers and practitioners, offering a generalized framework that enhances existing techniques, though it is incremental as it builds on prior knowledge transfer methods.

The paper tackles the problem of limited effectiveness in knowledge transfer for model compression and transfer learning by proposing a Contrastive Knowledge Transfer Framework (CKTF) that transfers structural knowledge from teacher to student models, resulting in performance improvements of 0.04% to 11.59% in model compression and 0.4% to 4.75% in transfer learning.

Knowledge Transfer (KT) achieves competitive performance and is widely used for image classification tasks in model compression and transfer learning. Existing KT works transfer the information from a large model ("teacher") to train a small model ("student") by minimizing the difference of their conditionally independent output distributions. However, these works overlook the high-dimension structural knowledge from the intermediate representations of the teacher, which leads to limited effectiveness, and they are motivated by various heuristic intuitions, which makes it difficult to generalize. This paper proposes a novel Contrastive Knowledge Transfer Framework (CKTF), which enables the transfer of sufficient structural knowledge from the teacher to the student by optimizing multiple contrastive objectives across the intermediate representations between them. Also, CKTF provides a generalized agreement to existing KT techniques and increases their performance significantly by deriving them as specific cases of CKTF. The extensive evaluation shows that CKTF consistently outperforms the existing KT works by 0.04% to 11.59% in model compression and by 0.4% to 4.75% in transfer learning on various models and datasets.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes