SYLGJun 2, 2025

Data-assimilated model-informed reinforcement learning

arXiv:2506.01755v31 citationsh-index: 3Proceedings of the Royal Society A Mathematical Physical and Engineering Science
Originality Incremental advance
AI Analysis

This addresses the problem of controlling high-dimensional chaotic systems for applications like fluid dynamics or engineering, though it is incremental as it builds on existing RL and data assimilation methods.

The paper tackles controlling chaotic systems with partial and noisy observations by developing DA-MIRL, a framework integrating low-order models, data assimilation, and reinforcement learning, which successfully estimates and suppresses chaotic dynamics in the Kuramoto-Sivashinsky equation in real-time.

The control of spatio-temporally chaos is challenging because of high dimensionality and unpredictability. Model-free reinforcement learning (RL) discovers optimal control policies by interacting with the system, typically requiring observations of the full physical state. In practice, sensors often provide only partial and noisy measurements (observations) of the system. The objective of this paper is to develop a framework that enables the control of chaotic systems with partial and noisy observability. The proposed method, data-assimilated model-informed reinforcement learning (DA-MIRL), integrates (i) low-order models to approximate high-dimensional dynamics; (ii) sequential data assimilation to correct the model prediction when observations become available; and (iii) an off-policy actor-critic RL algorithm to adaptively learn an optimal control strategy based on the corrected state estimates. We test DA-MIRL on the spatiotemporally chaotic solutions of the Kuramoto-Sivashinsky equation. We estimate the full state of the environment with (i) a physics-based model, here, a coarse-grained model; and (ii) a data-driven model, here, the control-aware echo state network, which is proposed in this paper. We show that DA-MIRL successfully estimates and suppresses the chaotic dynamics of the environment in real time from partial observations and approximate models. This work opens opportunities for the control of partially observable chaotic systems.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes