Fast Spectrogram Event Extraction via Offline Self-Supervised Learning: From Fusion Diagnostics to Bioacoustics

arXiv:2602.20317v1h-index: 7Has Code
Originality Incremental advance
AI Analysis

This addresses the data deluge problem in fusion facilities like ITER, enabling automated analysis for plasma control, though it is incremental as it builds on existing signal processing techniques.

The paper tackles the challenge of analyzing large volumes of noisy time-frequency data from fusion diagnostics by developing a self-supervised framework for automated extraction of coherent and transient modes, achieving an inference latency of 0.5 seconds for real-time mode identification.

Next-generation fusion facilities like ITER face a "data deluge," generating petabytes of multi-diagnostic signals daily that challenge manual analysis. We present a "signals-first" self-supervised framework for the automated extraction of coherent and transient modes from high-noise time-frequency data. We also develop a general-purpose method and tool for extracting coherent, quasi-coherent, and transient modes for fluctuation measurements in tokamaks by employing non-linear optimal techniques in multichannel signal processing with a fast neural network surrogate on fast magnetics, electron cyclotron emission, CO2 interferometers, and beam emission spectroscopy measurements from DIII-D. Results are tested on data from DIII-D, TJ-II, and non-fusion spectrograms. With an inference latency of 0.5 seconds, this framework enables real-time mode identification and large-scale automated database generation for advanced plasma control. Repository is in https://github.com/PlasmaControl/TokEye.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes