Ming Cao

SY
h-index4
42papers
1,162citations
Novelty44%
AI Score55

42 Papers

SIMay 26, 2016
Structural balance and opinion separation in trust-mistrust social networks

Weiguo Xia, Ming Cao, Karl Henrik Johansson

Structural balance theory has been developed in sociology and psychology to explain how interacting agents, e.g., countries, political parties, opinionated individuals, with mixed trust and mistrust relationships evolve into polarized camps. Recent results have shown that structural balance is necessary for polarization in networks with fixed, strongly connected neighbor relationships when the opinion dynamics are described by DeGroot-type averaging rules. We develop this line of research in this paper in two steps. First, we consider fixed, not necessarily strongly connected, neighbor relationships. It is shown that if the network includes a strongly connected subnetwork containing mistrust, which influences the rest of the network, then no opinion clustering is possible when that subnetwork is not structurally balanced; all the opinions become neutralized in the end. In contrast, it is shown that when that subnetwork is indeed structurally balanced, the agents of the subnetwork evolve into two polarized camps and the opinions of all other agents in the network spread between these two polarized opinions. Second, we consider time-varying neighbor relationships. We show that the opinion separation criteria carry over if the conditions for fixed graphs are extended to joint graphs. The results are developed for both discrete-time and continuous-time models.

99.4ROMar 15Code
World In Your Hands: A Large-Scale and Open-Source Ecosystem for Learning Human-Centric Manipulation in the Wild

Yupeng Zheng, Jichao Peng, Weize Li et al. · cmu, tsinghua

We introduce World In Your Hands (WIYH), a large-scale open-source ecosystem comprising over 1,000 hours of human manipulation data collected in-the-wild with millimeter-scale motion accuracy. Specifically, WIYH includes (1) the Oracle Suite, a wearable data collection kit with an auto-labeling pipeline for accurate motion capture; (2) the WIYH Dataset, featuring over 1,000 hours of multimodal manipulation data across hundreds of skills in diverse real-world scenarios; and (3) extensive annotations and benchmarks supporting tasks from perception to action. Furthermore, experiments based on the WIYH ecosystem show that integrating WIYH's human-centric data improves robotic manipulation success rates from 8% to 60% in cluttered scenes. World In Your Hands provides a foundation for advancing human-centric data collection and cross-embodiment policy learning. All data and hardware design will be open-source.

CLNov 13, 2022Code
Language Model Classifier Aligns Better with Physician Word Sensitivity than XGBoost on Readmission Prediction

Grace Yang, Ming Cao, Lavender Y. Jiang et al.

Traditional evaluation metrics for classification in natural language processing such as accuracy and area under the curve fail to differentiate between models with different predictive behaviors despite their similar performance metrics. We introduce sensitivity score, a metric that scrutinizes models' behaviors at the vocabulary level to provide insights into disparities in their decision-making logic. We assess the sensitivity score on a set of representative words in the test set using two classifiers trained for hospital readmission classification with similar performance statistics. Our experiments compare the decision-making logic of clinicians and classifiers based on rank correlations of sensitivity scores. The results indicate that the language model's sensitivity score aligns better with the professionals than the xgboost classifier on tf-idf embeddings, which suggests that xgboost uses some spurious features. Overall, this metric offers a novel perspective on assessing models' robustness by quantifying their discrepancy with professional opinions. Our code is available on GitHub (https://github.com/nyuolab/Model_Sensitivity).

SYFeb 12, 2019
Lyapunov Criterion for Stochastic Systems and Its Applications in Distributed Computation

Yuzhen Qin, Ming Cao, Brian D. O. Anderson

This paper presents new sufficient conditions for convergence and asymptotic or exponential stability of a stochastic discrete-time system, under which the constructed Lyapunov function always decreases in expectation along the system's solutions after a finite number of steps, but without necessarily strict decrease at every step, in contrast to the classical stochastic Lyapunov theory. As the first application of this new Lyapunov criterion, we look at the product of any random sequence of stochastic matrices, including those with zero diagonal entries, and obtain sufficient conditions to ensure the product almost surely converges to a matrix with identical rows; we also show that the rate of convergence can be exponential under additional conditions. As the second application, we study a distributed network algorithm for solving linear algebraic equations. We relax existing conditions on the network structures, while still guaranteeing the equations are solved asymptotically.

OCMay 17, 2013
Asynchronous Decentralized Event-triggered Control

Manuel Mazo, Ming Cao

In this paper we propose an approach to the implementation of controllers with decentralized strategies triggering controller updates. We consider set-ups with a central node in charge of the computation of the control commands, and a set of not co-located sensors providing measurements to the controller node. The solution we propose does not require measurements from the sensors to be synchronized in time. The sensors in our proposal provide measurements in an aperiodic way triggered by local conditions. Furthermore, in the proposed implementation (most of) the communication between nodes requires only the exchange of one bit of information (per controller update), which could aid in reducing transmission delays and as a secondary effect result in fewer transmissions being triggered.

SYJan 6, 2018
Distributed Event-Triggered Control for Asymptotic Synchronization of Dynamical Networks

Tao Liu, Ming Cao, Claudio De Persis et al.

This paper studies synchronization of dynamical networks with event-based communication. Firstly, two estimators are introduced into each node, one to estimate its own state, and the other to estimate the average state of its neighbours. Then, with these two estimators, a distributed event-triggering rule (ETR) with a dwell time is designed such that the network achieves synchronization asymptotically with no Zeno behaviours. The designed ETR only depends on the information that each node can obtain, and thus can be implemented in a decentralized way.

SYApr 23, 2017
Opinion evolution in time-varying social influence networks with prejudiced agents

Anton V. Proskurnikov, Roberto Tempo, Ming Cao et al.

Investigation of social influence dynamics requires mathematical models that are "simple" enough to admit rigorous analysis, and yet sufficiently "rich" to capture salient features of social groups. Thus, the mechanism of iterative opinion pooling from (DeGroot, 1974), which can explain the generation of consensus, was elaborated in (Friedkin and Johnsen, 1999) to take into account individuals' ongoing attachments to their initial opinions, or prejudices. The "anchorage" of individuals to their prejudices may disable reaching consensus and cause disagreement in a social influence network. Further elaboration of this model may be achieved by relaxing its restrictive assumption of a time-invariant influence network. During opinion dynamics on an issue, arcs of interpersonal influence may be added or subtracted from the network, and the influence weights assigned by an individual to his/her neighbors may alter. In this paper, we establish new important properties of the (Friedkin and Johnsen, 1999) opinion formation model, and also examine its extension to time-varying social influence networks.

SYAug 13, 2018
Quantization effects and convergence properties of rigid formation control systems with quantized distance measurements

Zhiyong Sun, Hector Garcia de Marina, Brian D. O. Anderson et al.

In this paper, we discuss quantization effects in rigid formation control systems when target formations are described by inter-agent distances. Because of practical sensing and measurement constraints, we consider in this paper distance measurements in their quantized forms. We show that under gradient-based formation control, in the case of uniform quantization, the distance errors converge locally to a bounded set whose size depends on the quantization error, while in the case of logarithmic quantization, all distance errors converge locally to zero. A special quantizer involving the signum function is then considered with which all agents can only measure coarse distances in terms of binary information. In this case, the formation converges locally to a target formation within a finite time. Lastly, we discuss the effect of asymmetric uniform quantization on rigid formation control.

SYDec 20, 2016
Synchronization of Pulse-Coupled Oscillators and Clocks under Minimal Connectivity Assumptions

Anton V. Proskurnikov, Ming Cao

Populations of flashing fireflies, claps of applauding audience, cells of cardiac and circadian pacemakers reach synchrony via event-triggered interactions, referred to as pulse couplings. Synchronization via pulse coupling is widely used in wireless sensor networks, providing clock synchronization with parsimonious packet exchanges. In spite of serious attention paid to networks of pulse coupled oscillators, there is a lack of mathematical results, addressing networks with general communication topologies and general phase-response curves of the oscillators. The most general results of this type (Wang et al., 2012, 2015) establish synchronization of oscillators with a delay-advance phase-response curve over strongly connected networks. In this paper we extend this result by relaxing the connectivity condition to the existence of a root node (or a directed spanning tree) in the graph. This condition is also necessary for synchronization.

SYJan 10, 2016
Synchronization of Goodwin's oscillators under boundedness and nonnegativeness constraints for solutions

Anton V. Proskurnikov, Ming Cao

In the recent paper by Hamadeh et al. (2012) an elegant analytic criterion for incremental output feedback passivity (iOFP) of cyclic feedback systems (CFS) has been reported, assuming that the constituent subsystems are incrementally output strictly passive (iOSP). This criterion was used to prove that a network of identical CFS can be synchronized under sufficiently strong linear diffusive coupling. A very important class of CFS consists of biological oscillators, named after Brian Goodwin and describing self-regulated chains of enzymatic reactions, where the product of each reaction catalyzes the next reaction, while the last product inhibits the first reaction in the chain. Goodwin's oscillators are used, in particular, to model the dynamics of genetic circadian pacemakers, hormonal cycles and some metabolic pathways. In this paper we point out that for Goodwin's oscillators, where the individual reactions have nonlinear (e.g. Mikhaelis-Menten) kinetics, the synchronization criterion, obtained by Hamadeh et al., cannot be directly applied. This criterion relies on the implicit assumption of the solution boundedness, dictated also by the chemical feasibility (the state variables stand for the concentrations of chemicals). Furthermore, to test the synchronization condition one needs to know an explicit bound for a solution, which generally cannot be guaranteed under linear coupling. At the same time, we show that these restrictions can be avoided for a nonlinear synchronization protocol, where the control inputs are "saturated" by a special nonlinear function (belonging to a wide class), which guarantees nonnegativity of the solutions and allows to get explicit ultimate bounds for them. We prove that oscillators synchronize under such a protocol, provided that the couplings are sufficiently strong.

SYMar 20, 2017
Modulus consensus in discrete-time signed networks and properties of special recurrent inequalities

Anton V. Proskurnikov, Ming Cao

Recently the dynamics of signed networks, where the ties among the agents can be both positive (attractive) or negative (repulsive) have attracted substantial attention of the research community. Examples of such networks are models of opinion dynamics over signed graphs, recently introduced by Altafini (2012,2013) and extended to discrete-time case by Meng et al. (2014). It has been shown that under mild connectivity assumptions these protocols provide the convergence of opinions in absolute value, whereas their signs may differ. This "modulus consensus" may correspond to the polarization of the opinions (or bipartite consensus, including the usual consensus as a special case), or their convergence to zero. In this paper, we demonstrate that the phenomenon of modulus consensus in the discrete-time Altafini model is a manifestation of a more general and profound fact, regarding the solutions of a special recurrent inequality. Although such a recurrent inequality does not provide the uniqueness of a solution, it can be shown that, under some natural assumptions, each of its bounded solutions has a limit and, moreover, converges to consensus. A similar property has previously been established for special continuous-time differential inequalities (Proskurnikov, Cao, 2016). Besides analysis of signed networks, we link the consensus properties of recurrent inequalities to the convergence analysis of distributed optimization algorithms and the problems of Schur stability of substochastic matrices.

SYMar 23, 2017
Distributed algorithm for controlling scale-free polygonal formations

Hector Garcia de Marina, Bayu Jayawardhana, Ming Cao

This paper presents a distributed algorithm for controlling the deployment of a team of mobile agents in formations whose shapes can be characterized by a broad class of polygons, including regular ones, where each agent occupies a corner of the polygon. The algorithm shares the appealing properties of the popular distance-based rigid formation control, but with the additional advantage of requiring the control of fewer pairs of neighboring agents. Furthermore, the scale of the polygon can be controlled by only one pair of neighboring agents. We also exploit the exponential stability of the controlled formation in order to steer the formation as a whole with translations and rotations in a prescribed way. We provide both theoretical analysis and illustrative simulations.

SYOct 13, 2017
Local and global analysis of endocrine regulation as a non-cyclic feedback system

Hadi Taghvafard, Anton V. Proskurnikov, Ming Cao

To understand the sophisticated control mechanisms of the human's endocrine system is a challenging task that is a crucial step towards precise medical treatment of many disfunctions and diseases. Although mathematical models describing the endocrine system as a whole are still elusive, recently some substantial progress has been made in analyzing theoretically its subsystems (or axes) that regulate production of specific hormones. Many of the relevant mathematical models are similar in structure to (or squarely based on) the celebrated Goodwin's oscillator. Such models are convenient to explain stable periodic oscillations at hormones' level by representing the corresponding endocrine regulation circuits as cyclic feedback systems. However, many real hormonal regulation mechanisms (in particular, testosterone regulation) are in fact known to have non-cyclic structures and involve multiple feedbacks; a Goodwin-type model thus represents only a part of such a complicated mechanism. In this paper, we examine a new mathematical model of hormonal regulation, obtained from the classical Goodwin's oscillator by introducing an additional negative feedback. Local stability properties of the proposed model are studied, and we show that the local instability of its unique equilibrium implies oscillatory behavior of almost all solutions. Furthermore, under additional restrictions we prove that almost all solutions converge to periodic ones.

OCNov 11, 2018
Towards time-varying proximal dynamics in Multi-Agent Network Games

Carlo Cenedese, Yu Kawano, Sergio Grammatico et al.

Distributed decision making in multi-agent networks has recently attracted significant research attention thanks to its wide applicability, e.g. in the management and optimization of computer networks, power systems, robotic teams, sensor networks and consumer markets. Distributed decision-making problems can be modeled as inter-dependent optimization problems, i.e., multi-agent game-equilibrium seeking problems, where noncooperative agents seek an equilibrium by communicating over a network. To achieve a network equilibrium, the agents may decide to update their decision variables via proximal dynamics, driven by the decision variables of the neighboring agents. In this paper, we provide an operator-theoretic characterization of convergence with a time-invariant communication network. For the time-varying case, we consider adjacency matrices that may switch subject to a dwell time. We illustrate our investigations using a distributed robotic exploration example.

SYJul 14, 2020
Optimal Universal Controllers for Roll Stabilization

Yuri A. Kapitanyuk, Anton V. Proskurnikov, Ming Cao

Roll stabilization is an important problem of ship motion control. This problem becomes especially difficult if the same set of actuators (e.g. a single rudder) has to be used for roll stabilization and heading control of the vessel, so that the roll stabilizing system interferes with the ship autopilot. Finding the "trade-off" between the concurrent goals of accurate vessel steering and roll stabilization usually reduces to an optimization problem, which has to be solved in presence of an unknown wave disturbance. Standard approaches to this problem (loop-shaping, LQG, $H_{\infty}$-control etc.) require to know the spectral density of the disturbance, considered to be a \colored noise". In this paper, we propose a novel approach to optimal roll stabilization, approximating the disturbance by a polyharmonic signal with known frequencies yet uncertain amplitudes and phase shifts. Linear quadratic optimization problems in presence of polyharmonic disturbances can be solved by means of the theory of universal controllers developed by V.A. Yakubovich. An optimal universal controller delivers the optimal solution for any uncertain amplitudes and phases. Using Marine Systems Simulator (MSS) Toolbox that provides a realistic vessel's model, we compare our design method with classical approaches to optimal roll stabilization. Among three controllers providing the same quality of yaw steering, OUC stabilizes the roll motion most efficiently.

ROOct 11, 2023
Co-NavGPT: Multi-Robot Cooperative Visual Semantic Navigation Using Vision Language Models

Bangguo Yu, Qihao Yuan, Kailai Li et al.

Visual target navigation is a critical capability for autonomous robots operating in unknown environments, particularly in human-robot interaction scenarios. While classical and learning-based methods have shown promise, most existing approaches lack common-sense reasoning and are typically designed for single-robot settings, leading to reduced efficiency and robustness in complex environments. To address these limitations, we introduce Co-NavGPT, a novel framework that integrates a Vision Language Model (VLM) as a global planner to enable common-sense multi-robot visual target navigation. Co-NavGPT aggregates sub-maps from multiple robots with diverse viewpoints into a unified global map, encoding robot states and frontier regions. The VLM uses this information to assign frontiers across the robots, facilitating coordinated and efficient exploration. Experiments on the Habitat-Matterport 3D (HM3D) demonstrate that Co-NavGPT outperforms existing baselines in terms of success rate and navigation efficiency, without requiring task-specific training. Ablation studies further confirm the importance of semantic priors from the VLM. We also validate the framework in real-world scenarios using quadrupedal robots. Supplementary video and code are available at: https://sites.google.com/view/co-navgpt2.

11.9SYApr 8
Markov Chains and Random Walks with Memory on Hypergraphs: A Tensor-Based Approach

Shaoxuan Cui, Lingfei Wang, Hildeberto Jardon-Kojakhmetov et al.

Many complex systems exhibit interactions that depend not only on pairwise connections, but also group structures and memory effects. To capture such effects, we develop a unified tensor framework for modeling higher-order Markov chains with memory. Our formulation introduces an even-order paired tensor that links folded and unfolded dynamics and characterizes their steady states and convergence. We further show that a Markov chain with memory can be approximated by a low-dimensional nonlinear tensor-based system and then provide a full system analysis. As an application, we define random walks on hypergraphs where memory naturally arises from the hyperedge structure, providing new tools for analyzing higher-order networks with time-dependent effects.

ROMar 31, 2025Code
GenSwarm: Scalable Multi-Robot Code-Policy Generation and Deployment via Language Models

Wenkang Ji, Huaben Chen, Mingyang Chen et al.

The development of control policies for multi-robot systems traditionally follows a complex and labor-intensive process, often lacking the flexibility to adapt to dynamic tasks. This has motivated research on methods to automatically create control policies. However, these methods require iterative processes of manually crafting and refining objective functions, thereby prolonging the development cycle. This work introduces \textit{GenSwarm}, an end-to-end system that leverages large language models to automatically generate and deploy control policies for multi-robot tasks based on simple user instructions in natural language. As a multi-language-agent system, GenSwarm achieves zero-shot learning, enabling rapid adaptation to altered or unseen tasks. The white-box nature of the code policies ensures strong reproducibility and interpretability. With its scalable software and hardware architectures, GenSwarm supports efficient policy deployment on both simulated and real-world multi-robot systems, realizing an instruction-to-execution end-to-end functionality that could prove valuable for robotics specialists and non-specialists alike.The code of the proposed GenSwarm system is available online: https://github.com/WindyLab/GenSwarm.

29.8SYApr 9
Linear Feedback Controller for Homogeneous Polynomial Systems

Shaoxuan Cui, Qi Zhao, Guanlin Li et al.

This paper studies stabilization and its corresponding closed-loop region-of-attraction (ROA) for homogeneous polynomial dynamical systems whose nonlinear term admits an orthogonally decomposable (ODECO) tensor representation. While recent tensor-based results provide explicit solutions and sharp global characterizations for open-loop ODECO systems, closed-loop synthesis and computable ROA estimates are still often dominated by local linearization or Lyapunov/SOS (sum of squares) methods, which can be conservative and computationally demanding. We propose a structure-preserving linear feedback design that shares the ODECO eigenbasis of the system's tensor, thereby enabling closed-form trajectory expressions, explicit convergence/escape thresholds, and sharp ROA characterizations. Under mild conditions, we further derive robustness/ISS-type bounds for bounded disturbances. Numerical examples validate the theoretical results.

ROMar 24, 2021Code
Pole-like Objects Mapping and Long-Term Robot Localization in Dynamic Urban Scenarios

Zhihao Wang, Silin Li, Ming Cao et al.

Localization on 3D data is a challenging task for unmanned vehicles, especially in long-term dynamic urban scenarios. Due to the generality and long-term stability, the pole-like objects are very suitable as landmarks for unmanned vehicle localization in time-varing scenarios. In this paper, a long-term LiDAR-only localization algorithm based on semantic cluster map is proposed. At first, the Convolutional Neural Network(CNN) is used to infer the semantics of LiDAR point clouds. Combined with the point cloud segmentation, the long-term static objects pole/trunk in the scene are extracted and registered into a semantic cluster map. When the unmanned vehicle re-enters the environment again, the relocalization is completed by matching the clusters of the local map with the clusters of the global map. Furthermore, the continuous matching between the local and global maps stably outputs the global pose at 2Hz to correct the drift of the 3D LiDAR odometry. The proposed approach realizes localization in the long-term scenarios without maintaining the high-precision point cloud map. The experimental results on our campus dataset demonstrate that the proposed approach performs better in localization accuracy compared with the current state-of-the-art methods. The source of this paper is available at: http://www.github.com/HITSZ-NRSL/long-term-localization.

68.3DSApr 10
Modelling the coevolution of opinion dynamics and decision making in social dilemmas

Ella C. Davidson, Lorenzo Zino, Ming Cao et al.

This paper proposes a mathematical model for the coevolution of actions and opinions for a population facing a social dilemma. In particular, we assume each person participates in a Public Goods Game (PGG), with their action being to cooperate or defect, and holds an opinion about which action they prefer. We propose a payoff function that combines the PGG with the Friedkin--Johnsen model from opinion dynamics to form a coevolutionary game. According to a discrete-time process, players asynchronously update their actions and opinions, aiming to maximise their individual payoff for the coevolutionary game using myopic best-response. We study the equilibria and provide conditions for the existence of the all-defection and all-cooperation consensus equilibria. We also establish conditions for global convergence to the all-defection equilibrium.

26.1GTMay 8
Zero-determinant Strategy for Moving Target Defense: Existence, Performance, and Computation

Zhaoyang Cheng, Guanpu Chen, Yiguang Hong et al.

Moving Target Defense (MTD) is commonly formulated as a repeated security game to mitigate persistent threats. Although the strong Stackelberg equilibrium (SSE) characterizes the defender's optimal strategy in the leader-follower framework, computing the SSE often incurs high computational complexity, which significantly limits its practical deployment in MTD problems with multiple targets. This paper proposes adopting a zero-determinant (ZD) strategy for constructing an MTD strategy that achieves both high defensive performance and substantially low computational complexity. We first derive a necessary and sufficient condition for the existence of ZD strategies and investigate the performance of ZD strategies, which shows their upper-bound performance matches that of the SSE strategy. We then formulate two programs to find the optimal ZD strategy parameters under different conditions. Moreover, we design an algorithm to compute the proposed ZD strategies, along with the computational complexity analysis in comparison with the traditional SSE computation. Finally, we conduct experiments on two practical applications to verify our results.

7.5SYMay 4
Awareness in collective decision-making: Modeling and control in a game-theoretic framework

Mengbin Ye, Lorenzo Zino, Ming Cao

For a society to remain healthy and prosperous, people must collectively behave and act to contribute to the common good, even if there is often a tradeoff against their individual benefit. Paradigmatic examples include the adoption of sustainable behaviors and technologies to combat the climate crisis, and the mobilization for collective action to promote the rights and freedoms of repressed minorities. In this tutorial, we illustrate how game theory and network systems theory can be powerful tools to model and study this collective decision-making problem. We provide examples of how awareness of this tradeoff can impact collective change toward the societal good, exploring different problem contexts such as sustainable behavior and collective action. Finally, we review recent developments using systems and control-theoretic approaches to generate awareness and guide the emergent population dynamics towards a desired outcome, and conclude by highlighting new research and application frontiers.

ROJan 13
Fairness risk and its privacy-enabled solution in AI-driven robotic applications

Le Liu, Bangguo Yu, Nynke Vellinga et al.

Complex decision-making by autonomous machines and algorithms could underpin the foundations of future society. Generative AI is emerging as a powerful engine for such transitions. However, we show that Generative AI-driven developments pose a critical pitfall: fairness concerns. In robotic applications, although intuitions about fairness are common, a precise and implementable definition that captures user utility and inherent data randomness is missing. Here we provide a utility-aware fairness metric for robotic decision making and analyze fairness jointly with user-data privacy, deriving conditions under which privacy budgets govern fairness metrics. This yields a unified framework that formalizes and quantifies fairness and its interplay with privacy, which is tested in a robot navigation task. In view of the fact that under legal requirements, most robotic systems will enforce user privacy, the approach shows surprisingly that such privacy budgets can be jointly used to meet fairness targets. Addressing fairness concerns in the creative combined consideration of privacy is a step towards ethical use of AI and strengthens trust in autonomous robots deployed in everyday environments.

LGMay 13, 2025
High-order Regularization for Machine Learning and Learning-based Control

Xinghua Liu, Ming Cao

The paper proposes a novel regularization procedure for machine learning. The proposed high-order regularization (HR) provides new insight into regularization, which is widely used to train a neural network that can be utilized to approximate the action-value function in general reinforcement learning problems. The proposed HR method ensures the provable convergence of the approximation algorithm, which makes the much-needed connection between regularization and explainable learning using neural networks. The proposed HR method theoretically demonstrates that regularization can be regarded as an approximation in terms of inverse mapping with explicitly calculable approximation error, and the $L_2$ regularization is a lower-order case of the proposed method. We provide lower and upper bounds for the error of the proposed HR solution, which helps build a reliable model. We also find that regularization with the proposed HR can be regarded as a contraction. We prove that the generalizability of neural networks can be maximized with a proper regularization matrix, and the proposed HR is applicable for neural networks with any mapping matrix. With the theoretical explanation of the extreme learning machine for neural network training and the proposed high-order regularization, one can better interpret the output of the neural network, thus leading to explainable learning. We present a case study based on regularized extreme learning neural networks to demonstrate the application of the proposed HR and give the corresponding incremental HR solution. We verify the performance of the proposed HR method by solving a classic control problem in reinforcement learning. The result demonstrates the superior performance of the method with significant enhancement in the generalizability of the neural network.

OCMay 16, 2021
Robust optimal policies for team Markov games

Feng Huang, Ming Cao, Long Wang

In stochastic dynamic environments, team Markov games have emerged as a versatile paradigm for studying sequential decision-making problems of fully cooperative multi-agent systems. However, the optimality of the derived policies is usually sensitive to model parameters, which are typically unknown and required to be estimated from noisy data in practice. To mitigate the sensitivity of optimal policies to these uncertain parameters, we propose a robust model of team Markov games in this paper, where agents utilize robust optimization approaches to update strategies. This model extends team Markov games to the scenario of incomplete information and meanwhile provides an alternative solution concept of robust team optimality. To seek such a solution, we develop a robust iterative learning algorithm of team policies and prove its convergence. This algorithm, compared with robust dynamic programming, not only possesses a faster convergence rate, but also allows for using approximation calculations to alleviate the curse of dimensionality. Moreover, some numerical simulations are presented to demonstrate the effectiveness of the algorithm by generalizing the game model of sequential social dilemmas to uncertain scenarios.

ROMar 23, 2021
Distributed coordinated path following using guiding vector fields

Weijia Yao, Hector Garcia de Marina, Zhiyong Sun et al.

It is essential in many applications to impose a scalable coordinated motion control on a large group of mobile robots, which is efficient in tasks requiring repetitive execution, such as environmental monitoring. In this paper, we design a guiding vector field to guide multiple robots to follow possibly different desired paths while coordinating their motions. The vector field uses a path parameter as a virtual coordinate that is communicated among neighboring robots. Then, the virtual coordinate is utilized to control the relative parametric displacement between robots along the paths. This enables us to design a saturated control algorithm for a Dubins-car-like model. The algorithm is distributed, scalable, and applicable for any smooth paths in an $n$-dimensional configuration space, and global convergence is guaranteed. Simulations with up to fifty robots and outdoor experiments with fixed-wing aircraft validate the theoretical results.

RODec 3, 2020
Singularity-free Guiding Vector Field for Robot Navigation

Weijia Yao, Hector Garcia de Marina, Bohuan Lin et al.

Most of the existing path-following navigation algorithms cannot guarantee global convergence to desired paths or enable following self-intersected desired paths due to the existence of singular points where navigation algorithms return unreliable or even no solutions. One typical example arises in vector-field guided path-following (VF-PF) navigation algorithms. These algorithms are based on a vector field, and the singular points are exactly where the vector field diminishes. In this paper, we show that it is mathematically impossible for conventional VF-PF algorithms to achieve global convergence to desired paths that are self-intersected or even just simple closed (precisely, homeomorphic to the unit circle). Motivated by this new impossibility result, we propose a novel method to transform self-intersected or simple closed desired paths to non-self-intersected and unbounded (precisely, homeomorphic to the real line) counterparts in a higher-dimensional space. Corresponding to this new desired path, we construct a singularity-free guiding vector field on a higher-dimensional space. The integral curves of this new guiding vector field is thus exploited to enable global convergence to the higher-dimensional desired path, and therefore the projection of the integral curves on a lower-dimensional subspace converge to the physical (lower-dimensional) desired path. Rigorous theoretical analysis is carried out for the theoretical results using dynamical systems theory. In addition, we show both by theoretical analysis and numerical simulations that our proposed method is an extension combining conventional VF-PF algorithms and trajectory tracking algorithms. Finally, to show the practical value of our proposed approach for complex engineering systems, we conduct outdoor experiments with a fixed-wing airplane in windy environment to follow both 2D and 3D desired paths.

SYJun 29, 2019
Distributed Global Output-Feedback Control for a Class of Euler-Lagrange Systems

Qingkai Yang, Hao Fang, Jie Chen et al.

This published paper investigates the distributed tracking control problem for a class of Euler-Lagrange multi-agent systems when the agents can only measure the positions. In this case, the lack of the separation principle and the strong nonlinearity in unmeasurable states pose severe technical challenges to global output-feedback control design. To overcome these difficulties, a global nonsingular coordinate transformation matrix in the upper triangular form is firstly proposed such that the nonlinear dynamic model can be partially linearized with respect to the unmeasurable states. And, a new type of velocity observers is designed to estimate the unmeasurable velocities for each system. Then, based on the outputs of the velocity observers, we propose distributed control laws that enable the coordinated tracking control system to achieve uniform global exponential stability (UGES). Both theoretical analysis and numerical simulations are presented to validate the effectiveness of the proposed control scheme. Followed by the original paper, a typo and a mistake is corrected.

SYMay 1, 2019
Asynchronous Distributed Power Control of Multi-Microgrid Systems Based on the Operator Splitting Approach

Zhaojian Wang, Shengwei Mei, Feng Liu et al.

Forming (hybrid) AC/DC microgrids (MGs) has become a promising manner for the interconnection of various kinds of distributed generators that are inherently AC or DC electric sources. This paper addresses the distributed asynchronous power control problem of hybrid microgrids, considering imperfect communication due to non-identical sampling rates and communication delays. To this end, we first formulate the optimal power control problem of MGs and devise a synchronous algorithm. Then, we analyze the impact of asynchrony on optimal power control and propose an asynchronous iteration algorithm based on the synchronous version. By introducing a random clock at each iteration, different types of asynchrony are fitted into a unified framework, where the asynchronous algorithm is converted into a fixed-point problem based on the operator splitting method, leading to a convergence proof. We further provide an upper bound estimation of the time delay in the communication. Moreover, the real-time implementation of the proposed algorithm in both AC and DC MGs is introduced. By taking the power system as a solver, the controller is simplified by reducing one order and the power loss can be considered. Finally, a benchmark MG is utilized to verify the effectiveness and advantages of the proposed algorithm.

ROSep 20, 2018
Multi-robot motion-formation distributed control with sensor self-calibration: experimental validation

Hector Garcia de Marina, Johan Siemonsma, Bayu Jayawardhana et al.

In this paper, we present the design and implementation of a robust motion formation distributed control algorithm for a team of mobile robots. The primary task for the team is to form a geometric shape, which can be freely translated and rotated at the same time. This approach makes the robots to behave as a cohesive whole, which can be useful in tasks such as collaborative transportation. The robustness of the algorithm relies on the fact that each robot employs only local measurements from a laser sensor which does not need to be off-line calibrated. Furthermore, robots do not need to exchange any information with each other. Being free of sensor calibration and not requiring a communication channel helps the scaling of the overall system to a large number of robots. In addition, since the robots do not need any off-board localization system, but require only relative positions with respect to their neighbors, it can be aimed to have a full autonomous team that operates in environments where such localization systems are not available. The computational cost of the algorithm is inexpensive and the resources from a standard microcontroller will suffice. This fact makes the usage of our approach appealing as a support for other more demanding algorithms, e.g., processing images from onboard cameras. We validate the performance of the algorithm with a team of four mobile robots equipped with low-cost commercially available laser scanners.

SYApr 3, 2017
Controlling a triangular flexible formation of autonomous agents

Hector Garcia de Marina, Zhiyong Sun, Ming Cao et al.

In formation control, triangular formations consisting of three autonomous agents serve as a class of benchmarks that can be used to test and compare the performances of different controllers. We present an algorithm that combines the advantages of both position- and distance-based gradient descent control laws. For example, only two pairs of neighboring agents need to be controlled, agents can work in their own local frame of coordinates and the orientation of the formation with respect to a global frame of coordinates is not prescribed. We first present a novel technique based on adding artificial biases to neighboring agents' range sensors such that their eventual positions correspond to a collinear configuration. Right after, a small modification in the bias terms by introducing a prescribed rotation matrix will allow the control of the bearing of the neighboring agents.

SYJun 29, 2017
Differential Inequalities in Multi-Agent Coordination and Opinion Dynamics Modeling

Anton V. Proskurnikov, Ming Cao

Distributed algorithms of multi-agent coordination have attracted substantial attention from the research community; the simplest and most thoroughly studied of them are consensus protocols in the form of differential or difference equations over general time-varying weighted graphs. These graphs are usually characterized algebraically by their associated Laplacian matrices. Network algorithms with similar algebraic graph theoretic structures, called being of Laplacian-type in this paper, also arise in other related multi-agent control problems, such as aggregation and containment control, target surrounding, distributed optimization and modeling of opinion evolution in social groups. In spite of their similarities, each of such algorithms has often been studied using separate mathematical techniques. In this paper, a novel approach is offered, allowing a unified and elegant way to examine many Laplacian-type algorithms for multi-agent coordination. This approach is based on the analysis of some differential or difference inequalities that have to be satisfied by the some "outputs" of the agents (e.g. the distances to the desired set in aggregation problems). Although such inequalities may have many unbounded solutions, under natural graphic connectivity conditions all their bounded solutions converge (and even reach consensus), entailing the convergence of the corresponding distributed algorithms. In the theory of differential equations the absence of bounded non-convergent solutions is referred to as the equation's dichotomy. In this paper, we establish the dichotomy criteria of Laplacian-type differential and difference inequalities and show that these criteria enable one to extend a number of recent results, concerned with Laplacian-type algorithms for multi-agent coordination and modeling opinion formation in social groups.

SYJun 4, 2017
Persistent Flows and Non-Reciprocal Interactions in Deterministic Networks

Weiguo Xia, Guodong Shi, Ziyang Meng et al.

This paper studies deterministic consensus networks with discrete-time dynamics under persistent flows and non-reciprocal agent interactions. An arc describing the interaction strength between two agents is said to be persistent if its weight function has an infinite $l_1$ norm. We discuss two balance conditions on the interactions between agents which generalize the arc-balance and cut-balance conditions in the literature respectively. The proposed conditions require that such a balance should be satisfied over each time window of a fixed length instead of at each time instant. We prove that in both cases global consensus is reached if and only if the persistent graph, which consists of all the persistent arcs, contains a directed spanning tree. The convergence rates of the system to consensus are also provided in terms of the interactions between agents having taken place. The results are obtained under a weak condition without assuming the existence of a positive lower bound of all the nonzero weights of arcs and are compared with the existing results. Illustrative examples are provided to show the critical importance of the nontrivial lower boundedness of the self-confidence of the agents.

OCFeb 15, 2017
Target assignment for robots constrained by limited communication range

Xiaoshan Bai, Weisheng Yan, Ming Cao et al.

This paper investigates the task assignment problem for multiple dispersed robots constrained by limited communication range. The robots are initially randomly distributed and need to visit several target locations while minimizing the total travel time. A centralized rendezvous-based algorithm is proposed, under which all the robots first move towards a rendezvous position until communication paths are established between every pair of robots either directly or through intermediate peers, and then one robot is chosen as the leader to make a centralized task assignment for the other robots. Furthermore, we propose a decentralized algorithm based on a single-traveling-salesman tour, which does not require all the robots to be connected through communication. We investigate the variation of the quality of the assignment solutions as the level of information sharing increases and as the communication range grows, respectively. The proposed algorithms are compared with a centralized algorithm with shared global information and a decentralized greedy algorithm respectively. Monte Carlo simulation results show the satisfying performance of the proposed algorithms.

SYOct 14, 2016
A guiding vector field algorithm for path following control of nonholonomic mobile robots

Yuri A. Kapitanyuk, Anton V. Proskurnikov, Ming Cao

In this paper we propose an algorithm for path following control of the nonholonomic mobile robot based on the idea of the guiding vector field (GVF). The desired path may be an arbitrary smooth curve in its implicit form, that is, a level set of a predefined smooth function. Using this function and the robot's kinematic model, we design a GVF, whose integral curves converge to the trajectory. A nonlinear motion controller is then proposed which steers the robot along such an integral curve, bringing it to the desired path. We establish global convergence conditions for our algorithm and demonstrate its applicability and performance by experiments with real wheeled robots.

ROOct 10, 2016
Guidance algorithm for smooth trajectory tracking of a fixed wing UAV flying in wind flows

Hector Garcia de Marina, Yuri A. Kapitanyuk, Murat Bronz et al.

This paper presents an algorithm for solving the problem of tracking smooth curves by a fixed wing unmanned aerial vehicle travelling with a constant airspeed and under a constant wind disturbance. The algorithm is based on the idea of following a guiding vector field which is constructed from the implicit function that describes the desired (possibly time-varying) trajectory. The output of the algorithm can be directly expressed in terms of the bank angle of the UAV in order to achieve coordinated turns. Furthermore, the algorithm can be tuned offline such that physical constraints of the UAV, e.g. the maximum bank angle, will not be violated in a neighborhood of the desired trajectory. We provide the corresponding theoretical convergence analysis and performance results from actual flights.

SYSep 23, 2016
Distributed scaling control of rigid formations

Hector Garcia de Marina, Bayu Jayawardhana, Ming Cao

Recently it has been reported that biased range-measurements among neighboring agents in the gradient distance-based formation control can lead to predictable collective motion. In this paper we take advantage of this effect and by introducing distributed parameters to the prescribed inter-distances we are able to manipulate the steady-state motion of the formation. This manipulation is in the form of inducing simultaneously the combination of constant translational and angular velocities and a controlled scaling of the rigid formation. While the computation of the distributed parameters for the translational and angular velocities is based on the well-known graph rigidity theory, the parameters responsible for the scaling are based on some recent findings in bearing rigidity theory. We carry out the stability analysis of the modified gradient system and simulations in order to validate the main result.

ROSep 21, 2016
Controlling rigid formations of mobile agents under inconsistent measurements

Hector Garcia de Marina, Ming Cao, Bayu Jayawardhana

Despite the great success of using gradient-based controllers to stabilize rigid formations of autonomous agents in the past years, surprising yet intriguing undesirable collective motions have been reported recently when inconsistent measurements are used in the agents' local controllers. To make the existing gradient control robust against such measurement inconsistency, we exploit local estimators following the well known internal model principle for robust output regulation control. The new estimator-based gradient control is still distributed in nature and can be constructed systematically even when the number of agents in a rigid formation grows. We prove rigorously that the proposed control is able to guarantee exponential convergence and then demonstrate through robotic experiments and computer simulations that the reported inconsistency-induced orbits of collective movements are effectively eliminated.

ROApr 26, 2016
Distributed rotational and translational maneuvering of rigid formations and their applications

Hector Garcia de Marina, Bayu Jayawardhana, Ming Cao

Recently it has been reported that range-measurement inconsistency, or equivalently mismatches in prescribed inter-agent distances, may prevent the popular gradient controllers from guiding rigid formations of mobile agents to converge to their desired shape, and even worse from standing still at any location. In this paper, instead of treating mismatches as the source of ill performance, we take them as design parameters and show that by introducing such a pair of parameters per distance constraint, distributed controller achieving simultaneously both formation and motion control can be designed that not only encompasses the popular gradient control, but more importantly allows us to achieve constant collective translation, rotation or their combination while guaranteeing asymptotically no distortion in the formation shape occurs. Such motion control results are then applied to (a) the alignment of formations orientations and (b) enclosing and tracking a moving target. Besides rigorous mathematical proof, experiments using mobile robots are demonstrated to show the satisfying performances of the proposed formation-motion distributed controller.

OCApr 11, 2016
Taming mismatches in inter-agent distances for the formation-motion control of second-order agents

Hector Garcia de Marina, Bayu Jayawardhana, Ming Cao

This paper presents the analysis on the influence of distance mismatches on the standard gradient-based rigid formation control for second-order agents. It is shown that, similar to the first-order case as recently discussed in the literature, these mismatches introduce two undesired group behaviors: a distorted final shape and a steady-state motion of the group formation. We show that such undesired behaviors can be eliminated by combining the standard formation control law with distributed estimators. Finally, we show how the mismatches can be effectively employed as design parameters in order to control a combined translational and rotational motion of the formation.

SYAug 20, 2015
Opinion Dynamics in Social Networks with Hostile Camps: Consensus vs. Polarization

Anton V. Proskurnikov, Alexey Matveev, Ming Cao

Most of the distributed protocols for multi-agent consensus assume that the agents are mutually cooperative and "trustful," and so the couplings among the agents bring the values of their states closer. Opinion dynamics in social groups, however, require beyond these conventional models due to ubiquitous competition and distrust between some pairs of agents, which are usually characterized by repulsive couplings and may lead to clustering of the opinions. A simple yet insightful model of opinion dynamics with both attractive and repulsive couplings was proposed recently by C. Altafini, who examined first-order consensus algorithms over static signed graphs. This protocol establishes modulus consensus, where the opinions become the same in modulus but may differ in signs. In this paper, we extend the modulus consensus model to the case where the network topology is an arbitrary time-varying signed graph and prove reaching modulus consensus under mild sufficient conditions of uniform connectivity of the graph. For cut-balanced graphs, not only sufficient, but also necessary conditions for modulus consensus are given.