CV HCJul 16, 2025

Deep Neural Encoder-Decoder Model to Relate fMRI Brain Activity with Naturalistic Stimuli

Florian David, Michael Chan, Elenor Morgenroth, Patrik Vuilleumier, Dimitri Van De Ville

arXiv:2507.12009v13.6h-index: 52EMBC

Originality Incremental advance

AI Analysis

This work addresses the challenge of bridging temporal gaps in brain imaging for neuroscience researchers, though it appears incremental as it builds on existing deep learning approaches for fMRI analysis.

The authors tackled the problem of relating fMRI brain activity to naturalistic visual stimuli by developing a deep neural encoder-decoder model that predicts voxel activity and reconstructs visual inputs, finding that key visual regions like the middle occipital area contribute to decoding edges, faces, and contrasts.

We propose an end-to-end deep neural encoder-decoder model to encode and decode brain activity in response to naturalistic stimuli using functional magnetic resonance imaging (fMRI) data. Leveraging temporally correlated input from consecutive film frames, we employ temporal convolutional layers in our architecture, which effectively allows to bridge the temporal resolution gap between natural movie stimuli and fMRI acquisitions. Our model predicts activity of voxels in and around the visual cortex and performs reconstruction of corresponding visual inputs from neural activity. Finally, we investigate brain regions contributing to visual decoding through saliency maps. We find that the most contributing regions are the middle occipital area, the fusiform area, and the calcarine, respectively employed in shape perception, complex recognition (in particular face perception), and basic visual features such as edges and contrasts. These functions being strongly solicited are in line with the decoder's capability to reconstruct edges, faces, and contrasts. All in all, this suggests the possibility to probe our understanding of visual processing in films using as a proxy the behaviour of deep learning models such as the one proposed in this paper.

View on arXiv PDF

Similar