SYSYApr 14, 2019

Optimal control of batch processes via a deterministic Q-learning method

arXiv:1904.036541 citations
Originality Synthesis-oriented
AI Analysis

For chemical engineers operating batch reactors, this work offers a more robust online optimization approach that can handle sudden process changes, though it is tested only on simple models.

This paper proposes a deterministic Q-learning method for robust online optimization of batch reactors, enabling control actions to adapt to unplanned changes during the process. The method is tested on simple batch reactor models and compared to other dynamic optimization methods, showing effectiveness in maximizing final product or meeting product specifications.

Dynamic optimization of nonlinear chemical systems -- such as batch reactors -- should be applied online, and the suitable control taken should be according to the current state of the system rather than the current time instant. The recent state of the art methods applies the control based on the current time instant only. This is not suitable for most cases, as it is not robust to possible changes in the system. This paper proposes a Deterministic Q-Learning method to conduct robust online optimization of batch reactors. In this paper, the Q-Learning method is applied on simple batch reactor models; and in order to show the effectiveness of the proposed method the results are compared to other dynamic optimization methods. The main advantage of the Q-learning method or the proposed method is that it can accommodate unplanned changes during the process via changing the control action; i.e. the main advantage of the proposed method that it can overcome sudden changes during the reaction. In general, we try to maximize the final product obtained or meet certain specifications of the products (e.g. minimize side products).

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes