Self-supervised representation learning from electroencephalography signals
This work addresses the challenge of costly data labeling in EEG analysis, particularly for sleep scoring, but it is incremental as it adapts existing self-supervised techniques to a specific domain.
The paper tackled the problem of learning representations from electroencephalography (EEG) signals without labeled data by using self-supervised strategies, such as predicting if time windows are from the same context, and demonstrated that this approach outperforms purely supervised methods in low-data regimes for sleep scoring tasks on two datasets.
The supervised learning paradigm is limited by the cost - and sometimes the impracticality - of data collection and labeling in multiple domains. Self-supervised learning, a paradigm which exploits the structure of unlabeled data to create learning problems that can be solved with standard supervised approaches, has shown great promise as a pretraining or feature learning approach in fields like computer vision and time series processing. In this work, we present self-supervision strategies that can be used to learn informative representations from multivariate time series. One successful approach relies on predicting whether time windows are sampled from the same temporal context or not. As demonstrated on a clinically relevant task (sleep scoring) and with two electroencephalography datasets, our approach outperforms a purely supervised approach in low data regimes, while capturing important physiological information without any access to labels.