LG CVJul 8, 2022

Towards a More Rigorous Science of Blindspot Discovery in Image Classification Models

Gregory Plumb, Nari Johnson, Ángel Alexander Cabrera, Ameet Talwalkar

CMU

arXiv:2207.04104v37.89 citationsh-index: 51Has Code

Originality Incremental advance

AI Analysis

This work addresses the need for more rigorous evaluation in blindspot discovery for image classification, which is incremental but important for improving model reliability.

The paper tackles the problem of evaluating blindspot discovery methods (BDMs) in image classifiers by introducing a new framework, SpotCheck, and a new BDM, PlaneSpot, showing that PlaneSpot is competitive with or outperforms existing methods in controlled and real-data experiments.

A growing body of work studies Blindspot Discovery Methods ("BDM"s): methods that use an image embedding to find semantically meaningful (i.e., united by a human-understandable concept) subsets of the data where an image classifier performs significantly worse. Motivated by observed gaps in prior work, we introduce a new framework for evaluating BDMs, SpotCheck, that uses synthetic image datasets to train models with known blindspots and a new BDM, PlaneSpot, that uses a 2D image representation. We use SpotCheck to run controlled experiments that identify factors that influence BDM performance (e.g., the number of blindspots in a model, or features used to define the blindspot) and show that PlaneSpot is competitive with and in many cases outperforms existing BDMs. Importantly, we validate these findings by designing additional experiments that use real image data from MS-COCO, a large image benchmark dataset. Our findings suggest several promising directions for future work on BDM design and evaluation. Overall, we hope that the methodology and analyses presented in this work will help facilitate a more rigorous science of blindspot discovery.

View on arXiv PDF Code

Similar