Lixian Zhang

h-index71

5papers

327citations

Novelty52%

AI Score32

Ranked #124,278 of 194,257 authors (top 64%)#3,692 in RO (top 55%)

5 Papers

1.2SYJan 11, 2017

Cloud-Aided State Estimation of A Full-Car Semi-Active Suspension System

Lixian Zhang, Xunyuan Yin, Junnan Shen et al.

In this work, we investigate a state estimation problem for a full-car semi-active suspension system. To account for the complex calculation and optimization problems, a vehicle-to- cloud-to-vehicle (V2C2V) scheme is utilized. Moving horizon estimation is introduced for the state estimation system design. All the optimization problems are solved in a remotely-embedded agent with high computational ability. Measurements and state estimates are transmitted between the vehicle and the remote agent via networked communication channels. The effectiveness of the proposed method is illustrated via a set of simulations.

10.3LGNov 7, 2019Code

$H_\infty$ Model-free Reinforcement Learning with Robust Stability Guarantee

Minghao Han, Yuan Tian, Lixian Zhang et al.

Reinforcement learning is showing great potentials in robotics applications, including autonomous driving, robot manipulation and locomotion. However, with complex uncertainties in the real-world environment, it is difficult to guarantee the successful generalization and sim-to-real transfer of learned policies theoretically. In this paper, we introduce and extend the idea of robust stability and $H_\infty$ control to design policies with both stability and robustness guarantee. Specifically, a sample-based approach for analyzing the Lyapunov stability and performance robustness of a learning-based control system is proposed. Based on the theoretical results, a maximum entropy algorithm is developed for searching Lyapunov function and designing a policy with provable robust stability guarantee. Without any specific domain knowledge, our method can find a policy that is robust to various uncertainties and generalizes well to different test environments. In our experiments, we show that our method achieves better robustness to both large impulsive disturbances and parametric variations in the environment than the state-of-art results in both robust and generic RL, as well as classic control. Anonymous code is available to reproduce the experimental results at https://github.com/RobustStabilityGuaranteeRL/RobustStabilityGuaranteeRL.

8.9ROSep 10, 2021Code

Autonomous and Adaptive Navigation for Terrestrial-Aerial Bimodal Vehicles

Ruibin Zhang, Yuze Wu, Lixian Zhang et al.

Terrestrial-aerial bimodal vehicles bloom in both academia and industry because they incorporate both the high mobility of aerial vehicles and the long endurance of ground vehicles. In this work, we present an autonomous and adaptive navigation framework to bring complete autonomy to this class of vehicles. The framework mainly includes 1) a hierarchical motion planner that generates safe and low-power terrestrial-aerial trajectories in unknown environments and 2) a unified motion controller which dynamically adjusts energy consumption in terrestrial locomotion. Extensive real-world experiments and benchmark comparisons are conducted on a customized robot platform to validate the proposed framework's robustness and performance. During the tests, the robot safely traverses complex environments with terrestrial-aerial integrated mobility, and achieves $7\times$ energy savings in terrestrial locomotion. Finally, we will release our code and hardware configuration for the reference of the community.

9.7SYNov 13, 2020

Reinforcement Learning Control of Constrained Dynamic Systems with Uniformly Ultimate Boundedness Stability Guarantee

Minghao Han, Yuan Tian, Lixian Zhang et al.

Reinforcement learning (RL) is promising for complicated stochastic nonlinear control problems. Without using a mathematical model, an optimal controller can be learned from data evaluated by certain performance criteria through trial-and-error. However, the data-based learning approach is notorious for not guaranteeing stability, which is the most fundamental property for any control system. In this paper, the classic Lyapunov's method is explored to analyze the uniformly ultimate boundedness stability (UUB) solely based on data without using a mathematical model. It is further shown how RL with UUB guarantee can be applied to control dynamic systems with safety constraints. Based on the theoretical results, both off-policy and on-policy learning algorithms are proposed respectively. As a result, optimal controllers can be learned to guarantee UUB of the closed-loop system both at convergence and during learning. The proposed algorithms are evaluated on a series of robotic continuous control tasks with safety constraints. In comparison with the existing RL algorithms, the proposed method can achieve superior performance in terms of maintaining safety. As a qualitative evaluation of stability, our method shows impressive resilience even in the presence of external disturbances.

26.4ROApr 29, 2020Code

Actor-Critic Reinforcement Learning for Control with Stability Guarantee

Minghao Han, Lixian Zhang, Jun Wang et al.

Reinforcement Learning (RL) and its integration with deep learning have achieved impressive performance in various robotic control tasks, ranging from motion planning and navigation to end-to-end visual manipulation. However, stability is not guaranteed in model-free RL by solely using data. From a control-theoretic perspective, stability is the most important property for any control system, since it is closely related to safety, robustness, and reliability of robotic systems. In this paper, we propose an actor-critic RL framework for control which can guarantee closed-loop stability by employing the classic Lyapunov's method in control theory. First of all, a data-based stability theorem is proposed for stochastic nonlinear systems modeled by Markov decision process. Then we show that the stability condition could be exploited as the critic in the actor-critic RL to learn a controller/policy. At last, the effectiveness of our approach is evaluated on several well-known 3-dimensional robot control tasks and a synthetic biology gene network tracking task in three different popular physics simulation platforms. As an empirical evaluation on the advantage of stability, we show that the learned policies can enable the systems to recover to the equilibrium or way-points when interfered by uncertainties such as system parametric variations and external disturbances to a certain extent.