LGAIOct 5, 2022

Transformer-based conditional generative adversarial network for multivariate time series generation

arXiv:2210.02089v117 citationsh-index: 17Has Code
Originality Incremental advance
AI Analysis

This work addresses a specific limitation in time series generation for data augmentation and simulation, but it is incremental as it builds directly on prior TTS-GAN methods.

The authors tackled the problem of generating realistic multivariate time series from mixture distributions by extending a Transformer-based GAN to condition on encoded contexts, enabling one model to handle multiple sub-components. They demonstrated its effectiveness on a human activity dataset, showing it can generate high-dimensional sequences under various conditions with improved statistical similarity.

Conditional generation of time-dependent data is a task that has much interest, whether for data augmentation, scenario simulation, completing missing data, or other purposes. Recent works proposed a Transformer-based Time series generative adversarial network (TTS-GAN) to address the limitations of recurrent neural networks. However, this model assumes a unimodal distribution and tries to generate samples around the expectation of the real data distribution. One of its limitations is that it may generate a random multivariate time series; it may fail to generate samples in the presence of multiple sub-components within an overall distribution. One could train models to fit each sub-component separately to overcome this limitation. Our work extends the TTS-GAN by conditioning its generated output on a particular encoded context allowing the use of one model to fit a mixture distribution with multiple sub-components. Technically, it is a conditional generative adversarial network that models realistic multivariate time series under different types of conditions, such as categorical variables or multivariate time series. We evaluate our model on UniMiB Dataset, which contains acceleration data following the XYZ axes of human activities collected using Smartphones. We use qualitative evaluations and quantitative metrics such as Principal Component Analysis (PCA), and we introduce a modified version of the Frechet inception distance (FID) to measure the performance of our model and the statistical similarities between the generated and the real data distributions. We show that this transformer-based CGAN can generate realistic high-dimensional and long data sequences under different kinds of conditions.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes