LGOct 3, 2023

DON-LSTM: Multi-Resolution Learning with DeepONets and Long Short-Term Memory Neural Networks

arXiv:2310.02491v12 citationsh-index: 142
AI Analysis

This work addresses the challenge of leveraging multi-resolution data for improved sequence modeling in scientific computing, representing an incremental advancement in neural network architectures.

The paper tackled the problem of modeling long-time evolution in non-linear systems by proposing DON-LSTM, a novel architecture combining DeepONets and LSTMs, which achieved significantly lower generalization error and required fewer high-resolution samples compared to baseline methods.

Deep operator networks (DeepONets, DONs) offer a distinct advantage over traditional neural networks in their ability to be trained on multi-resolution data. This property becomes especially relevant in real-world scenarios where high-resolution measurements are difficult to obtain, while low-resolution data is more readily available. Nevertheless, DeepONets alone often struggle to capture and maintain dependencies over long sequences compared to other state-of-the-art algorithms. We propose a novel architecture, named DON-LSTM, which extends the DeepONet with a long short-term memory network (LSTM). Combining these two architectures, we equip the network with explicit mechanisms to leverage multi-resolution data, as well as capture temporal dependencies in long sequences. We test our method on long-time-evolution modeling of multiple non-linear systems and show that the proposed multi-resolution DON-LSTM achieves significantly lower generalization error and requires fewer high-resolution samples compared to its vanilla counterparts.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes