IVAICVOct 25, 2024

NeuroClips: Towards High-fidelity and Smooth fMRI-to-Video Reconstruction

arXiv:2410.19452v333 citationsh-index: 9Has CodeNIPS
Originality Incremental advance
AI Analysis

This addresses the limited research on fMRI-to-video reconstruction, which could benefit neuroscience and brain-computer interface applications, though it appears incremental as it builds on existing CLIP and diffusion models.

The paper tackles the challenging problem of reconstructing smooth, high-fidelity videos from fMRI brain activity data, achieving significant improvements over state-of-the-art models with a 128% increase in SSIM and 81% improvement in spatiotemporal metrics for up to 6-second videos at 8FPS.

Reconstruction of static visual stimuli from non-invasion brain activity fMRI achieves great success, owning to advanced deep learning models such as CLIP and Stable Diffusion. However, the research on fMRI-to-video reconstruction remains limited since decoding the spatiotemporal perception of continuous visual experiences is formidably challenging. We contend that the key to addressing these challenges lies in accurately decoding both high-level semantics and low-level perception flows, as perceived by the brain in response to video stimuli. To the end, we propose NeuroClips, an innovative framework to decode high-fidelity and smooth video from fMRI. NeuroClips utilizes a semantics reconstructor to reconstruct video keyframes, guiding semantic accuracy and consistency, and employs a perception reconstructor to capture low-level perceptual details, ensuring video smoothness. During inference, it adopts a pre-trained T2V diffusion model injected with both keyframes and low-level perception flows for video reconstruction. Evaluated on a publicly available fMRI-video dataset, NeuroClips achieves smooth high-fidelity video reconstruction of up to 6s at 8FPS, gaining significant improvements over state-of-the-art models in various metrics, e.g., a 128% improvement in SSIM and an 81% improvement in spatiotemporal metrics. Our project is available at https://github.com/gongzix/NeuroClips.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes