SYLGApr 13, 2024

Active Learning for Control-Oriented Identification of Nonlinear Systems

arXiv:2404.09030v213 citationsh-index: 10CDC
AI Analysis

This work addresses the challenge of minimizing costly experimentation for controlling unknown nonlinear systems, representing a significant advancement over prior methods limited to linear models.

The paper tackles the problem of sample-efficient model-based reinforcement learning for nonlinear systems by introducing an active learning algorithm with finite sample analysis, achieving optimal excess control cost rates in certain settings.

Model-based reinforcement learning is an effective approach for controlling an unknown system. It is based on a longstanding pipeline familiar to the control community in which one performs experiments on the environment to collect a dataset, uses the resulting dataset to identify a model of the system, and finally performs control synthesis using the identified model. As interacting with the system may be costly and time consuming, targeted exploration is crucial for developing an effective control-oriented model with minimal experimentation. Motivated by this challenge, recent work has begun to study finite sample data requirements and sample efficient algorithms for the problem of optimal exploration in model-based reinforcement learning. However, existing theory and algorithms are limited to model classes which are linear in the parameters. Our work instead focuses on models with nonlinear parameter dependencies, and presents the first finite sample analysis of an active learning algorithm suitable for a general class of nonlinear dynamics. In certain settings, the excess control cost of our algorithm achieves the optimal rate, up to logarithmic factors. We validate our approach in simulation, showcasing the advantage of active, control-oriented exploration for controlling nonlinear systems.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes