SP AI LGJan 3, 2023

Unsupervised Multivariate Time-Series Transformers for Seizure Identification on EEG

İlkay Yıldız Potter, George Zerveas, Carsten Eickhoff, Dominique Duncan

Microsoft

arXiv:2301.03470v14.332 citationsh-index: 16Has Code

Originality Incremental advance

AI Analysis

This addresses the problem of expensive expert labeling for epilepsy monitoring, offering an unsupervised approach that could facilitate widely accessible detection, though it appears incremental as it adapts existing transformer methods to a specific domain.

The authors tackled automated seizure identification from EEG recordings by framing it as an unsupervised anomaly detection problem, using a transformer-based autoencoder with a novel masking strategy for multivariate time-series data. Their method achieved significantly better performance than supervised learning counterparts, with improvements of up to 16% recall, 9% accuracy, and 9% AUC on benchmark datasets.

Epilepsy is one of the most common neurological disorders, typically observed via seizure episodes. Epileptic seizures are commonly monitored through electroencephalogram (EEG) recordings due to their routine and low expense collection. The stochastic nature of EEG makes seizure identification via manual inspections performed by highly-trained experts a tedious endeavor, motivating the use of automated identification. The literature on automated identification focuses mostly on supervised learning methods requiring expert labels of EEG segments that contain seizures, which are difficult to obtain. Motivated by these observations, we pose seizure identification as an unsupervised anomaly detection problem. To this end, we employ the first unsupervised transformer-based model for seizure identification on raw EEG. We train an autoencoder involving a transformer encoder via an unsupervised loss function, incorporating a novel masking strategy uniquely designed for multivariate time-series data such as EEG. Training employs EEG recordings that do not contain any seizures, while seizures are identified with respect to reconstruction errors at inference time. We evaluate our method on three publicly available benchmark EEG datasets for distinguishing seizure vs. non-seizure windows. Our method leads to significantly better seizure identification performance than supervised learning counterparts, by up to 16% recall, 9% accuracy, and 9% Area under the Receiver Operating Characteristics Curve (AUC), establishing particular benefits on highly imbalanced data. Through accurate seizure identification, our method could facilitate widely accessible and early detection of epilepsy development, without needing expensive label collection or manual feature extraction.

View on arXiv PDF Code

Similar