CVAISPDec 13, 2024

CognitionCapturer: Decoding Visual Stimuli From Human EEG Signal With Multimodal Information

arXiv:2412.10489v240 citationsh-index: 23Has CodeAAAI
Originality Incremental advance
AI Analysis

This addresses the challenge of improving EEG-based brain-computer interfaces for applications like neuroimaging or assistive technologies, though it is incremental by building on existing multimodal and generative methods.

The paper tackles the problem of decoding visual stimuli from EEG signals by leveraging multimodal information beyond just image data, achieving state-of-the-art performance with high semantic and structural fidelity in reconstructions.

Electroencephalogram (EEG) signals have attracted significant attention from researchers due to their non-invasive nature and high temporal sensitivity in decoding visual stimuli. However, most recent studies have focused solely on the relationship between EEG and image data pairs, neglecting the valuable ``beyond-image-modality" information embedded in EEG signals. This results in the loss of critical multimodal information in EEG. To address this limitation, we propose CognitionCapturer, a unified framework that fully leverages multimodal data to represent EEG signals. Specifically, CognitionCapturer trains Modality Expert Encoders for each modality to extract cross-modal information from the EEG modality. Then, it introduces a diffusion prior to map the EEG embedding space to the CLIP embedding space, followed by using a pretrained generative model, the proposed framework can reconstruct visual stimuli with high semantic and structural fidelity. Notably, the framework does not require any fine-tuning of the generative models and can be extended to incorporate more modalities. Through extensive experiments, we demonstrate that CognitionCapturer outperforms state-of-the-art methods both qualitatively and quantitatively. Code: https://github.com/XiaoZhangYES/CognitionCapturer.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes