NALGDec 18, 2023

Volume-Preserving Transformers for Learning Time Series Data with Structure

arXiv:2312.11166v42 citationsh-index: 2ESAIM Proc Surv
Originality Incremental advance
AI Analysis

This work addresses the challenge of applying transformers to physical systems, offering a domain-specific improvement for time series data with inherent structure.

The authors tackled the problem of learning dynamical systems with transformers by modifying the attention activation to preserve structure, resulting in improved long-term stability for rigid body trajectory learning.

Two of the many trends in neural network research of the past few years have been (i) the learning of dynamical systems, especially with recurrent neural networks such as long short-term memory networks (LSTMs) and (ii) the introduction of transformer neural networks for natural language processing (NLP) tasks. While some work has been performed on the intersection of these two trends, those efforts were largely limited to using the vanilla transformer directly without adjusting its architecture for the setting of a physical system. In this work we develop a transformer-inspired neural network and use it to learn a dynamical system. We (for the first time) change the activation function of the attention layer to imbue the transformer with structure-preserving properties to improve long-term stability. This is shown to be of great advantage when applying the neural network to learning the trajectory of a rigid body.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes