CVOct 25, 2021

MoDeRNN: Towards Fine-grained Motion Details for Spatiotemporal Predictive Learning

arXiv:2110.12978v25 citations
Originality Incremental advance
AI Analysis

This work addresses the problem of improving prediction quality in spatiotemporal predictive learning for applications like weather forecasting, but it is incremental as it builds on existing ConvLSTM methods.

The paper tackles the challenge of predicting subsequent frames in spatiotemporal sequences by enhancing correspondence between context and current state, resulting in MoDeRNN outperforming state-of-the-art methods on Moving MNIST and Typhoon datasets with lower computation loads.

Spatiotemporal predictive learning (ST-PL) aims at predicting the subsequent frames via limited observed sequences, and it has broad applications in the real world. However, learning representative spatiotemporal features for prediction is challenging. Moreover, chaotic uncertainty among consecutive frames exacerbates the difficulty in long-term prediction. This paper concentrates on improving prediction quality by enhancing the correspondence between the previous context and the current state. We carefully design Detail Context Block (DCB) to extract fine-grained details and improve the isolated correlation between upper context state and current input state. We integrate DCB with standard ConvLSTM and introduce Motion Details RNN (MoDeRNN) to capture fine-grained spatiotemporal features and improve the expression of latent states of RNNs to achieve significant quality. Experiments on Moving MNIST and Typhoon datasets demonstrate the effectiveness of the proposed method. MoDeRNN outperforms existing state-of-the-art techniques qualitatively and quantitatively with lower computation loads.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes