LG NE NCSep 9, 2025

ArtifactGen: Benchmarking WGAN-GP vs Diffusion for Label-Aware EEG Artifact Synthesis

arXiv:2509.08188v1h-index: 10

Originality Synthesis-oriented

AI Analysis

This work addresses the costly labeling of EEG artifacts for automated analysis, but it is incremental as it benchmarks existing generative methods on a specific domain dataset.

The study tackled the problem of synthesizing realistic, label-aware EEG artifacts for augmentation and stress-testing, comparing WGAN-GP and diffusion models on the TUAR corpus, with WGAN-GP achieving closer spectral alignment and lower MMD to real data, while both models showed weak class-conditional recovery.

Artifacts in electroencephalography (EEG) -- muscle, eye movement, electrode, chewing, and shiver -- confound automated analysis yet are costly to label at scale. We study whether modern generative models can synthesize realistic, label-aware artifact segments suitable for augmentation and stress-testing. Using the TUH EEG Artifact (TUAR) corpus, we curate subject-wise splits and fixed-length multi-channel windows (e.g., 250 samples) with preprocessing tailored to each model (per-window min-max for adversarial training; per-recording/channel $z$-score for diffusion). We compare a conditional WGAN-GP with a projection discriminator to a 1D denoising diffusion model with classifier-free guidance, and evaluate along three axes: (i) fidelity via Welch band-power deltas ($Δδ,\ Δθ,\ Δα,\ Δβ$), channel-covariance Frobenius distance, autocorrelation $L_2$, and distributional metrics (MMD/PRD); (ii) specificity via class-conditional recovery with lightweight $k$NN/classifiers; and (iii) utility via augmentation effects on artifact recognition. In our setting, WGAN-GP achieves closer spectral alignment and lower MMD to real data, while both models exhibit weak class-conditional recovery, limiting immediate augmentation gains and revealing opportunities for stronger conditioning and coverage. We release a reproducible pipeline -- data manifests, training configurations, and evaluation scripts -- to establish a baseline for EEG artifact synthesis and to surface actionable failure modes for future work.

View on arXiv PDF

Similar