CVApr 11, 2023

Pinpointing Why Object Recognition Performance Degrades Across Income Levels and Geographies

Laura Gustafson, Megan Richards, Melissa Hall, Caner Hazirbas, Diane Bouchacourt, Mark Ibrahim

arXiv:2304.05391v110.48 citationsh-index: 20Has Code

Originality Incremental advance

AI Analysis

This work addresses inequity in AI by identifying specific causes of performance gaps in object recognition across income and geography, offering insights for more equitable vision systems.

The study investigated why object recognition models perform worse for lower-income and geographically diverse populations by annotating images from the Dollar Street dataset with factors like texture and lighting, finding that disparities are most linked to texture, occlusion, and darker lighting, and showing that mitigating texture vulnerabilities can improve performance for lower-income levels.

Despite impressive advances in object-recognition, deep learning systems' performance degrades significantly across geographies and lower income levels raising pressing concerns of inequity. Addressing such performance gaps remains a challenge, as little is understood about why performance degrades across incomes or geographies. We take a step in this direction by annotating images from Dollar Street, a popular benchmark of geographically and economically diverse images, labeling each image with factors such as color, shape, and background. These annotations unlock a new granular view into how objects differ across incomes and regions. We then use these object differences to pinpoint model vulnerabilities across incomes and regions. We study a range of modern vision models, finding that performance disparities are most associated with differences in texture, occlusion, and images with darker lighting. We illustrate how insights from our factor labels can surface mitigations to improve models' performance disparities. As an example, we show that mitigating a model's vulnerability to texture can improve performance on the lower income level. We release all the factor annotations along with an interactive dashboard to facilitate research into more equitable vision systems.

View on arXiv PDF Code

Similar