SYSYApr 18

Online Reinforcement Learning for Safe Gain Scheduling in Nonlinear Quadrotor Control

arXiv:2604.1681919.2h-index: 5
AI Analysis

For quadrotor control practitioners, this work provides a method to adapt controller gains online while ensuring safety, though it is incremental as it combines existing RL and control techniques.

This paper introduces an online reinforcement-learning framework for safe gain scheduling in nonlinear quadrotor control, using a deep Q-network to select from a library of pre-certified stabilizing controllers. Simulations demonstrate accurate trajectory tracking, bounded attitude, reduced control effort near convergence, and stable hover regulation.

This paper presents an online reinforcement-learning framework for safe gain scheduling of a nonlinear quadcopter controller. Rather than learning thrust and torque commands directly, the proposed method selects gain vectors online from a finite library of pre-certified stabilizing controllers, thereby preserving the structure of the underlying snap-based control law. Safety is enforced by restricting the policy to admissible gains that maintain forward invariance of a prescribed safe state set, while dwell-time constraints prevent excessively fast switching. To reduce the action-space dimension, translational gains are shared across spatial axes by exploiting the isotropic structure of the translational dynamics, whereas yaw gains are scheduled independently. A deep Q-network learns to adjust feedback authority according to the current flight condition, using aggressive gains during large transients and milder gains near hover. High-fidelity nonlinear simulations demonstrate accurate trajectory tracking, bounded attitude motion, reduced control effort near convergence, and stable hover regulation under online safe gain scheduling.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes