LGNov 17, 2020

Towards All-around Knowledge Transferring: Learning From Task-irrelevant Labels

Yinghui Li, Ruiyang Liu, ZiHao Zhang, Ning Ding, Ying Shen, Linmi Tao, Hai-Tao Zheng

arXiv:2011.08470v21.2

Originality Incremental advance

AI Analysis

This addresses the challenge of data scarcity in classification for researchers and practitioners by leveraging previously neglected task-irrelevant features, though it appears incremental as it builds on existing transfer learning methods.

The paper tackles the problem of negative transfer effects from task-irrelevant features in deep neural models for classification tasks with small datasets, proposing Task-Irrelevant Transfer Learning (TIRTL) to suppress such features and improve learning, with experimental validation on facial expression and digit recognition tasks.

Deep neural models have hitherto achieved significant performances on numerous classification tasks, but meanwhile require sufficient manually annotated data. Since it is extremely time-consuming and expensive to annotate adequate data for each classification task, learning an empirically effective model with generalization on small dataset has received increased attention. Existing efforts mainly focus on transferring task-relevant knowledge from other similar data to tackle the issue. These approaches have yielded remarkable improvements, yet neglecting the fact that the task-irrelevant features could bring out massive negative transfer effects. To date, no large-scale studies have been performed to investigate the impact of task-irrelevant features, let alone the utilization of this kind of features. In this paper, we firstly propose Task-Irrelevant Transfer Learning (TIRTL) to exploit task-irrelevant features, which mainly are extracted from task-irrelevant labels. Particularly, we suppress the expression of task-irrelevant information and facilitate the learning process of classification. We also provide a theoretical explanation of our method. In addition, TIRTL does not conflict with those that have previously exploited task-relevant knowledge and can be well combined to enable the simultaneous utilization of task-relevant and task-irrelevant features for the first time. In order to verify the effectiveness of our theory and method, we conduct extensive experiments on facial expression recognition and digit recognition tasks. Our source code will be also available in the future for reproducibility.

View on arXiv PDF

Similar