CVSep 9, 2017

Optimal Transport for Deep Joint Transfer Learning

arXiv:1709.02995v19 citations
Originality Incremental advance
AI Analysis

This work addresses the challenge of efficient knowledge transfer in deep learning for scenarios with small labeled datasets, offering a domain-specific improvement.

The authors tackled the problem of fine-tuning deep neural networks for classification with limited target data by proposing a joint transfer learning method that incorporates an optimal transport loss between source and target predictions, achieving improved performance over standard fine-tuning on image classification datasets.

Training a Deep Neural Network (DNN) from scratch requires a large amount of labeled data. For a classification task where only small amount of training data is available, a common solution is to perform fine-tuning on a DNN which is pre-trained with related source data. This consecutive training process is time consuming and does not consider explicitly the relatedness between different source and target tasks. In this paper, we propose a novel method to jointly fine-tune a Deep Neural Network with source data and target data. By adding an Optimal Transport loss (OT loss) between source and target classifier predictions as a constraint on the source classifier, the proposed Joint Transfer Learning Network (JTLN) can effectively learn useful knowledge for target classification from source data. Furthermore, by using different kind of metric as cost matrix for the OT loss, JTLN can incorporate different prior knowledge about the relatedness between target categories and source categories. We carried out experiments with JTLN based on Alexnet on image classification datasets and the results verify the effectiveness of the proposed JTLN in comparison with standard consecutive fine-tuning. This Joint Transfer Learning with OT loss is general and can also be applied to other kind of Neural Networks.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes