CVSep 8, 2022

Measuring Human Perception to Improve Open Set Recognition

arXiv:2209.03519v411 citationsh-index: 41
Originality Highly original
AI Analysis

This work addresses the challenge of recognizing novel objects in vision tasks for AI systems, offering a novel approach that leverages human behavioral data to enhance performance in data-limited scenarios.

The paper tackled the problem of open set recognition by incorporating human perception data, specifically reaction times, to improve algorithm performance, achieving significant improvements such as a 33.18% increase in top-1 test accuracy on unknown samples.

The human ability to recognize when an object belongs or does not belong to a particular vision task outperforms all open set recognition algorithms. Human perception as measured by the methods and procedures of visual psychophysics from psychology provides an additional data stream for algorithms that need to manage novelty. For instance, measured reaction time from human subjects can offer insight as to whether a class sample is prone to be confused with a different class -- known or novel. In this work, we designed and performed a large-scale behavioral experiment that collected over 200,000 human reaction time measurements associated with object recognition. The data collected indicated reaction time varies meaningfully across objects at the sample-level. We therefore designed a new psychophysical loss function that enforces consistency with human behavior in deep networks which exhibit variable reaction time for different images. As in biological vision, this approach allows us to achieve good open set recognition performance in regimes with limited labeled training data. Through experiments using data from ImageNet, significant improvement is observed when training Multi-Scale DenseNets with this new formulation: it significantly improved top-1 validation accuracy by 6.02%, top-1 test accuracy on known samples by 9.81%, and top-1 test accuracy on unknown samples by 33.18%. We compared our method to 10 open set recognition methods from the literature, which were all outperformed on multiple metrics.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes