CVAug 22, 2022

Minimizing the Effect of Noise and Limited Dataset Size in Image Classification Using Depth Estimation as an Auxiliary Task with Deep Multitask Learning

U of Toronto
arXiv:2208.10390v1h-index: 31Has Code
Originality Synthesis-oriented
AI Analysis

This work addresses generalizability issues in image classification for ML practitioners, but it is incremental as it applies existing dMTL methods to new datasets.

The paper tackled the problem of improving generalizability in image classification under noise and limited data by using depth estimation as an auxiliary task in deep multitask learning. It showed that dMTL improves classifier performance on noisy and small datasets, with experiments on MNIST and NYU-Depth-V2.

Generalizability is the ultimate goal of Machine Learning (ML) image classifiers, for which noise and limited dataset size are among the major concerns. We tackle these challenges through utilizing the framework of deep Multitask Learning (dMTL) and incorporating image depth estimation as an auxiliary task. On a customized and depth-augmented derivation of the MNIST dataset, we show a) multitask loss functions are the most effective approach of implementing dMTL, b) limited dataset size primarily contributes to classification inaccuracy, and c) depth estimation is mostly impacted by noise. In order to further validate the results, we manually labeled the NYU Depth V2 dataset for scene classification tasks. As a contribution to the field, we have made the data in python native format publicly available as an open-source dataset and provided the scene labels. Our experiments on MNIST and NYU-Depth-V2 show dMTL improves generalizability of the classifiers when the dataset is noisy and the number of examples is limited.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes