AINCNov 6, 2024

RTify: Aligning Deep Neural Networks with Human Behavioral Decisions

arXiv:2411.03630v210 citationsh-index: 19NIPS
Originality Incremental advance
AI Analysis

This work addresses the gap in modeling perceptual decision dynamics for vision science, though it appears incremental by building on existing models.

The authors tackled the problem of aligning neural network models with human behavioral decisions by developing a framework to match recurrent neural network dynamics to human reaction times, showing that it accounts well for human RT data in psychophysics experiments.

Current neural network models of primate vision focus on replicating overall levels of behavioral accuracy, often neglecting perceptual decisions' rich, dynamic nature. Here, we introduce a novel computational framework to model the dynamics of human behavioral choices by learning to align the temporal dynamics of a recurrent neural network (RNN) to human reaction times (RTs). We describe an approximation that allows us to constrain the number of time steps an RNN takes to solve a task with human RTs. The approach is extensively evaluated against various psychophysics experiments. We also show that the approximation can be used to optimize an "ideal-observer" RNN model to achieve an optimal tradeoff between speed and accuracy without human data. The resulting model is found to account well for human RT data. Finally, we use the approximation to train a deep learning implementation of the popular Wong-Wang decision-making model. The model is integrated with a convolutional neural network (CNN) model of visual processing and evaluated using both artificial and natural image stimuli. Overall, we present a novel framework that helps align current vision models with human behavior, bringing us closer to an integrated model of human vision.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes