LG AI ACC-PHJun 6, 2023

Learning to Do or Learning While Doing: Reinforcement Learning and Bayesian Optimisation for Online Continuous Tuning

Jan Kaiser, Chenran Xu, Annika Eichler, Andrea Santamaria Garcia, Oliver Stein, Erik Bründermann, Willi Kuropka, Hannes Dinter, Frank Mayet, Thomas Vinatier, Florian Burkart, Holger Schlarb

arXiv:2306.03739v13.86 citationsh-index: 52Has Code

Originality Incremental advance

AI Analysis

This work addresses the challenge of reducing manual intervention and improving efficiency in tuning complex real-world facilities like particle accelerators, providing incremental guidance for algorithm selection.

The study tackled the problem of autonomous tuning for real-world plants by comparing Reinforcement Learning-trained Optimization (RLO) and Bayesian Optimization (BO) in a particle accelerator task, finding that RLO generally outperforms BO but is not always optimal, with RLO achieving up to 30% faster tuning times in some scenarios.

Online tuning of real-world plants is a complex optimisation problem that continues to require manual intervention by experienced human operators. Autonomous tuning is a rapidly expanding field of research, where learning-based methods, such as Reinforcement Learning-trained Optimisation (RLO) and Bayesian optimisation (BO), hold great promise for achieving outstanding plant performance and reducing tuning times. Which algorithm to choose in different scenarios, however, remains an open question. Here we present a comparative study using a routine task in a real particle accelerator as an example, showing that RLO generally outperforms BO, but is not always the best choice. Based on the study's results, we provide a clear set of criteria to guide the choice of algorithm for a given tuning task. These can ease the adoption of learning-based autonomous tuning solutions to the operation of complex real-world plants, ultimately improving the availability and pushing the limits of operability of these facilities, thereby enabling scientific and engineering advancements.

View on arXiv PDF Code

Similar