LGOct 13, 2025

WaveletDiff: Multilevel Wavelet Diffusion For Time Series Generation

arXiv:2510.11839v17.11 citationsh-index: 2

Originality Highly original

AI Analysis

This addresses the scarcity of high-quality time series datasets for applications like healthcare and finance, representing a novel method rather than an incremental improvement.

The paper tackles the problem of generating synthetic time series data by introducing WaveletDiff, a diffusion model trained on wavelet coefficients to capture multi-scaled structures, achieving discriminative and Context-FID scores that are 3 times smaller on average than the second-best baseline across six datasets.

Time series are ubiquitous in many applications that involve forecasting, classification and causal inference tasks, such as healthcare, finance, audio signal processing and climate sciences. Still, large, high-quality time series datasets remain scarce. Synthetic generation can address this limitation; however, current models confined either to the time or frequency domains struggle to reproduce the inherently multi-scaled structure of real-world time series. We introduce WaveletDiff, a novel framework that trains diffusion models directly on wavelet coefficients to exploit the inherent multi-resolution structure of time series data. The model combines dedicated transformers for each decomposition level with cross-level attention mechanisms that enable selective information exchange between temporal and frequency scales through adaptive gating. It also incorporates energy preservation constraints for individual levels based on Parseval's theorem to preserve spectral fidelity throughout the diffusion process. Comprehensive tests across six real-world datasets from energy, finance, and neuroscience domains demonstrate that WaveletDiff consistently outperforms state-of-the-art time-domain and frequency-domain generative methods on both short and long time series across five diverse performance metrics. For example, WaveletDiff achieves discriminative scores and Context-FID scores that are $3\times$ smaller on average than the second-best baseline across all datasets.

View on arXiv PDF

Similar