LGNEAug 5, 2025

Minimal Convolutional RNNs Accelerate Spatiotemporal Learning

arXiv:2508.03614v1h-index: 44ICANN
Originality Incremental advance
AI Analysis

This work addresses the training efficiency bottleneck in spatiotemporal forecasting for domains like fluid dynamics and geoscience, offering an incremental improvement over existing ConvRNN methods.

The authors tackled the problem of slow training in spatiotemporal models by introducing MinConvLSTM and MinConvGRU, which combine convolutional recurrent networks with minimal, parallelizable RNNs, resulting in significantly faster training speeds and lower prediction errors on tasks like Navier-Stokes dynamics and geopotential data.

We introduce MinConvLSTM and MinConvGRU, two novel spatiotemporal models that combine the spatial inductive biases of convolutional recurrent networks with the training efficiency of minimal, parallelizable RNNs. Our approach extends the log-domain prefix-sum formulation of MinLSTM and MinGRU to convolutional architectures, enabling fully parallel training while retaining localized spatial modeling. This eliminates the need for sequential hidden state updates during teacher forcing - a major bottleneck in conventional ConvRNN models. In addition, we incorporate an exponential gating mechanism inspired by the xLSTM architecture into the MinConvLSTM, which further simplifies the log-domain computation. Our models are structurally minimal and computationally efficient, with reduced parameter count and improved scalability. We evaluate our models on two spatiotemporal forecasting tasks: Navier-Stokes dynamics and real-world geopotential data. In terms of training speed, our architectures significantly outperform standard ConvLSTMs and ConvGRUs. Moreover, our models also achieve lower prediction errors in both domains, even in closed-loop autoregressive mode. These findings demonstrate that minimal recurrent structures, when combined with convolutional input aggregation, offer a compelling and efficient alternative for spatiotemporal sequence modeling, bridging the gap between recurrent simplicity and spatial complexity.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes