LG AIFeb 27

CAMEL-CLIP: Channel-aware Multimodal Electroencephalography-text Alignment for Generalizable Brain Foundation Models

Hanseul Choi, Jinyeong Park, Seongwon Jin, Sungho Park, Jibum Kim

arXiv:2603.13272h-index: 10

AI Analysis

This addresses the challenge of channel variability in EEG data for researchers and practitioners in brain-computer interfaces and neuroscience, representing an incremental improvement with novel components.

The paper tackled the problem of EEG foundation models being sensitive to channel heterogeneity, proposing CAMEL-CLIP to achieve robust and generalizable EEG-text alignment, resulting in state-of-the-art performance under linear-probing and outperforming existing models that require full-finetuning.

Electroencephalography (EEG) foundation models have shown promise for learning generalizable representations, yet they remain sensitive to channel heterogeneity, such as changes in channel composition or ordering. We propose channel-aware multimodal EEG-text alignment contrastive language-image pretraining (CAMEL-CLIP), a contrastive EEG-text multimodal foundation model designed to be robust to heterogeneous channel configurations and widely applicable to diverse downstream tasks. CAMEL-CLIP introduces three key components: (1) channel attribute-based positional encoding, which identifies channels through semantic information; (2) dynamic channel projection, which generates variable-length embeddings by independently projecting each channel without feature compression; and (3) dual-level contrastive learning, which jointly performs channel-level and sample-level contrastive learning to capture both channel-specific and global signal characteristics. Experimental results demonstrate that CAMEL-CLIP achieves state-of-the-art performance under linear-probing and outperforms existing foundation models that rely on full-finetuning.

View on arXiv PDF

Similar