LGNCJun 18, 2021

Deep Reinforcement Learning Models Predict Visual Responses in the Brain: A Preliminary Result

arXiv:2106.10112v12 citations
Originality Incremental advance
AI Analysis

This work addresses the gap in visual neuroscience by incorporating embodied cognition, offering a potential new direction for modeling object recognition, though it is incremental as it builds on existing supervised approaches.

The study investigated whether deep reinforcement learning models, trained by interacting with a 3D game, could better explain object recognition in the primate visual system compared to supervised models. It found that reinforcement learning models achieved comparable neural prediction accuracy in early visual areas like V1 and V2, while supervised models performed better in higher visual areas.

Supervised deep convolutional neural networks (DCNNs) are currently one of the best computational models that can explain how the primate ventral visual stream solves object recognition. However, embodied cognition has not been considered in the existing visual processing models. From the ecological standpoint, humans learn to recognize objects by interacting with them, allowing better classification, specialization, and generalization. Here, we ask if computational models under the embodied learning framework can explain mechanisms underlying object recognition in the primate visual system better than the existing supervised models? To address this question, we use reinforcement learning to train neural network models to play a 3D computer game and we find that these reinforcement learning models achieve neural response prediction accuracy scores in the early visual areas (e.g., V1 and V2) in the levels that are comparable to those accomplished by the supervised neural network model. In contrast, the supervised neural network models yield better neural response predictions in the higher visual areas, compared to the reinforcement learning models. Our preliminary results suggest the future direction of visual neuroscience in which deep reinforcement learning should be included to fill the missing embodiment concept.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes