SP LGJun 14, 2023

Data Augmentation for Seizure Prediction with Generative Diffusion Model

Kai Shu, Le Wu, Yuchang Zhao, Aiping Liu, Ruobing Qian, Xun Chen

arXiv:2306.08256v211.334 citationsh-index: 42

Originality Incremental advance

AI Analysis

This work addresses data scarcity for seizure prediction in medical applications, representing an incremental improvement over existing linear transformation methods.

The paper tackles the problem of limited data diversity in EEG-based seizure prediction by proposing a novel diffusion-based data augmentation method called DiffEEG, which generates diverse synthetic samples to improve classifier performance, resulting in state-of-the-art metrics such as 95.4% sensitivity and 0.932 AUC on the CHB-MIT database.

Data augmentation (DA) can significantly strengthen the electroencephalogram (EEG)-based seizure prediction methods. However, existing DA approaches are just the linear transformations of original data and cannot explore the feature space to increase diversity effectively. Therefore, we propose a novel diffusion-based DA method called DiffEEG. DiffEEG can fully explore data distribution and generate samples with high diversity, offering extra information to classifiers. It involves two processes: the diffusion process and the denoised process. In the diffusion process, the model incrementally adds noise with different scales to EEG input and converts it into random noise. In this way, the representation of data can be learned. In the denoised process, the model utilizes learned knowledge to sample synthetic data from random noise input by gradually removing noise. The randomness of input noise and the precise representation enable the synthetic samples to possess diversity while ensuring the consistency of feature space. We compared DiffEEG with original, down-sampling, sliding windows and recombination methods, and integrated them into five representative classifiers. The experiments demonstrate the effectiveness and generality of our method. With the contribution of DiffEEG, the Multi-scale CNN achieves state-of-the-art performance, with an average sensitivity, FPR, AUC of 95.4%, 0.051/h, 0.932 on the CHB-MIT database and 93.6%, 0.121/h, 0.822 on the Kaggle database.

View on arXiv PDF

Similar