CV LGMar 30, 2023

Neglected Free Lunch -- Learning Image Classifiers Using Annotation Byproducts

Dongyoon Han, Junsuk Choe, Seonghyeok Chun, John Joon Young Chung, Minsuk Chang, Sangdoo Yun, Jean Y. Song, Seong Joon Oh

arXiv:2303.17595v35.94 citationsh-index: 38Has Code

Originality Highly original

AI Analysis

This addresses the issue of spurious correlations in image classification for AI researchers, offering a novel data augmentation approach.

The paper tackles the problem of supervised learning neglecting auxiliary annotation information like mouse traces, showing that using these byproducts (LUAB) improves model generalizability and robustness without extra annotation costs, with datasets ImageNet-AB and COCO-AB created for validation.

Supervised learning of image classifiers distills human knowledge into a parametric model through pairs of images and corresponding labels (X,Y). We argue that this simple and widely used representation of human knowledge neglects rich auxiliary information from the annotation procedure, such as the time-series of mouse traces and clicks left after image selection. Our insight is that such annotation byproducts Z provide approximate human attention that weakly guides the model to focus on the foreground cues, reducing spurious correlations and discouraging shortcut learning. To verify this, we create ImageNet-AB and COCO-AB. They are ImageNet and COCO training sets enriched with sample-wise annotation byproducts, collected by replicating the respective original annotation tasks. We refer to the new paradigm of training models with annotation byproducts as learning using annotation byproducts (LUAB). We show that a simple multitask loss for regressing Z together with Y already improves the generalisability and robustness of the learned models. Compared to the original supervised learning, LUAB does not require extra annotation costs. ImageNet-AB and COCO-AB are at https://github.com/naver-ai/NeglectedFreeLunch.

View on arXiv PDF Code

Similar