LGFeb 12

Self-Supervised Learning via Flow-Guided Neural Operator on Time-Series Data

Duy Nguyen, Jiachen Yao, Jiayun Wang, Julius Berner, Animashree Anandkumar

arXiv:2602.12267v21.4h-index: 7

Originality Highly original

AI Analysis

This addresses the need for more flexible and robust self-supervised learning methods for biomedical time-series data, particularly in low-data regimes.

The paper tackles the problem of static corruption levels in self-supervised learning for time-series data by proposing a flow-guided neural operator framework that treats corruption level as a degree of freedom, resulting in up to 35% AUROC gains in neural signal decoding, 16% RMSE reductions in temperature prediction, and over 20% accuracy improvements in sleep data.

Self-supervised learning (SSL) is a powerful paradigm for learning from unlabeled time-series data. However, popular methods such as masked autoencoders (MAEs) rely on reconstructing inputs from a fixed, predetermined masking ratio. Instead of this static design, we propose treating the corruption level as a new degree of freedom for representation learning, enhancing flexibility and performance. To achieve this, we introduce the Flow-Guided Neural Operator (FGNO), a novel framework combining operator learning with flow matching for SSL training. FGNO learns mappings in functional spaces by using Short-Time Fourier Transform to unify different time resolutions. We extract a rich hierarchy of features by tapping into different network layers and flow times that apply varying strengths of noise to the input data. This enables the extraction of versatile representations, from low-level patterns to high-level global features, using a single model adaptable to specific tasks. Unlike prior generative SSL methods that use noisy inputs during inference, we propose using clean inputs for representation extraction while learning representations with noise; this eliminates randomness and boosts accuracy. We evaluate FGNO across three biomedical domains, where it consistently outperforms established baselines. Our method yields up to 35% AUROC gains in neural signal decoding (BrainTreeBank), 16% RMSE reductions in skin temperature prediction (DREAMT), and over 20% improvement in accuracy and macro-F1 on SleepEDF under low-data regimes. These results highlight FGNO's robustness to data scarcity and its superior capacity to learn expressive representations for diverse time series.

View on arXiv PDF

Similar