Murat Arcak

SY
h-index60
48papers
115citations
Novelty51%
AI Score53

48 Papers

SYJun 21, 2016
Traffic Network Control from Temporal Logic Specifications

Samuel Coogan, Ebru Aydin Gol, Murat Arcak et al.

We propose a framework for generating a signal control policy for a traffic network of signalized intersections to accomplish control objectives expressible using linear temporal logic. By applying techniques from model checking and formal methods, we obtain a correct-by-construction controller that is guaranteed to satisfy complex specifications. To apply these tools, we identify and exploit structural properties particular to traffic networks that allow for efficient computation of a finite state abstraction. In particular, traffic networks exhibit a componentwise monotonicity property which allows reach set computations that scale linearly with the dimension of the continuous state space.

OCDec 29, 2016
Compositional abstraction for networks of control systems: A dissipativity approach

Majid Zamani, Murat Arcak

In this paper we propose a compositional scheme for the construction of abstractions for networks of control systems using the interconnection matrix and joint dissipativity-type properties of subsystems and their abstractions. In the proposed framework, the abstraction, itself a control system (possibly with a lower dimension), can be used as a substitution of the original system in the controller design process. Moreover, we provide a procedure for constructing abstractions of a class of nonlinear control systems by using the bounds on the slope of system nonlinearities. We illustrate the proposed results on a network of linear control systems by constructing its abstraction in a compositional way without requiring any condition on the number or gains of the subsystems. We use the abstraction as a substitute to synthesize a controller enforcing a certain linear temporal logic specification. This example particularly elucidates the effectiveness of dissipativity-type compositional reasoning for large-scale systems.

SYFeb 14, 2019
TIRA: Toolbox for Interval Reachability Analysis

Pierre-Jean Meyer, Alex Devonport, Murat Arcak

This paper presents TIRA, a Matlab library gathering several methods for the computation of interval over-approximations of the reachable sets for both continuous- and discrete-time nonlinear systems. Unlike other existing tools, the main strength of interval-based reachability analysis is its simplicity and scalability, rather than the accuracy of the over-approximations. The current implementation of TIRA contains four reachability methods covering wide classes of nonlinear systems, handled with recent results relying on contraction/growth bounds and monotonicity concepts. TIRA's architecture features a central function working as a hub between the user-defined reachability problem and the library of available reachability methods. This design choice offers increased extensibility of the library, where users can define their own method in a separate function and add the function call in the hub function.

SYNov 20, 2017
Finite Horizon Robustness Analysis of LTV Systems Using Integral Quadratic Constraints

Peter Seiler, Robert Moore, Chris Meissen et al.

The goal of this paper is to assess the robustness of an uncertain linear time-varying (LTV) system on a finite time horizon. The uncertain system is modeled as a connection of a known LTV system and a perturbation. The input/output behavior of the perturbation is described by time-domain, integral quadratic constraints (IQCs). Typical notions of robustness, e.g. nominal stability and gain/phase margins, can be insufficient for finite-horizon analysis. Instead, this paper focuses on robust induced gains and bounds on the reachable set of states. Sufficient conditions to compute robust performance bounds are formulated using dissipation inequalities and IQCs. The analysis conditions are provided in two equivalent forms as Riccati differential equations and differential linear matrix inequalities. A computational approach is provided that leverages both forms of the analysis conditions. The approach is demonstrated with two examples

SYSep 30, 2016
Mixed Monotonicity of Partial First-In-First-Out Traffic Flow Models

Samuel Coogan, Murat Arcak, Alexander A. Kurzhanskiy

In vehicle traffic networks, congestion on one outgoing link of a diverging junction often impedes flow to other outgoing links, a phenomenon known as the first-in-first-out (FIFO) property. Simplified traffic models that do not account for the FIFO property result in monotone dynamics for which powerful analysis techniques exist. FIFO models are in general not monotone, but have been shown to be mixed monotone - a generalization of monotonicity that enables similarly powerful analysis techniques. In this paper, we study traffic flow models for which the FIFO property is only partial, that is, flows at diverging junctions exhibit a combination of FIFO and non-FIFO phenomena. We show that mixed monotonicity extends to this wider class of models and establish conditions that guarantee convergence to an equilibrium.

DSJul 24, 2014
Pattern Formation with a Compartmental Lateral Inhibition System

Ana Sofia Rufino Ferreira, Justin Hsia, Murat Arcak et al.

We propose a compartmental lateral inhibition system that generates contrasting patterns of gene expression between neighboring compartments. The system consists of a set of compartments interconnected by channels. Each compartment contains a colony of cells that produce diffusible molecules to be detected by the neighboring colony, and each cell is equipped with an inhibitory circuit that reduces its production when the detected signal is stronger. We develop a technique to analyze the steady-state patterns emerging from this lateral inhibition system and apply it to a specific implementation. The analysis shows that the proposed system indeed exhibits contrasting patterns within realistic parameter ranges.

SYJun 13, 2018
Sampled-data reachability analysis using sensitivity and mixed-monotonicity

Pierre-Jean Meyer, Samuel Coogan, Murat Arcak

This paper over-approximates the reachable sets of a continuous-time uncertain system using the sensitivity of its trajectories with respect to initial conditions and uncertain parameters. We first prove the equivalence between an existing over-approximation result based on the sign-stability of the sensitivity matrices and a discrete-time approach relying on a mixed-monotonicity property. We then present a new over-approximation result which scales at worst linearly with the state dimension and is applicable to any continuous-time system with bounded sensitivity. Finally, we provide a simulation-based approach to estimate these bounds through sampling and falsification. The results are illustrated with numerical examples on traffic networks and satellite orbits.

SYApr 12, 2017
Sparsity-Sensitive Finite Abstraction

Felix Gruber, Eric S. Kim, Murat Arcak

Abstraction of a continuous-space model into a finite state and input dynamical model is a key step in formal controller synthesis tools. To date, these software tools have been limited to systems of modest size (typically $\leq$ 6 dimensions) because the abstraction procedure suffers from an exponential runtime with respect to the sum of state and input dimensions. We present a simple modification to the abstraction algorithm that dramatically reduces the computation time for systems exhibiting a sparse interconnection structure. This modified procedure recovers the same abstraction as the one computed by a brute force algorithm that disregards the sparsity. Examples highlight speed-ups from existing benchmarks in the literature, synthesis of a safety supervisory controller for a 12-dimensional and abstraction of a 51-dimensional vehicular traffic network.

DSDec 7, 2012
A Graph Partitioning Approach to Predict Patterns in Lateral Inhibition Systems

Ana S. Rufino Ferreira, Murat Arcak

We analyze pattern formation on a network of cells where each cell inhibits its neighbors through cell-to-cell contact signaling. The network is modeled as an interconnection of identical dynamical subsystems each of which represents the signaling reactions in a cell. We search for steady state patterns by partitioning the graph vertices into disjoint classes, where the cells in the same class have the same final fate. To prove the existence of steady states with this structure, we use results from monotone systems theory. Finally, we analyze the stability of these patterns with a block decomposition based on the graph partition.

SYSep 19, 2017
Simulation-based reachability analysis for nonlinear systems using componentwise contraction properties

Murat Arcak, John Maidens

A shortcoming of existing reachability approaches for nonlinear systems is the poor scalability with the number of continuous state variables. To mitigate this problem we present a simulation-based approach where we first sample a number of trajectories of the system and next establish bounds on the convergence or divergence between the samples and neighboring trajectories. We compute these bounds using contraction theory and reduce the conservatism by partitioning the state vector into several components and analyzing contraction properties separately in each direction. Among other benefits this allows us to analyze the effect of constant but uncertain parameters by treating them as state variables and partitioning them into a separate direction. We next present a numerical procedure to search for weighted norms that yield a prescribed contraction rate, which can be incorporated in the reachability algorithm to adjust the weights to minimize the growth of the reachable set.

SYNov 16, 2018
Robust Control of the Sit-to-Stand Movement for a Powered Lower Limb Orthosis

Octavio Narvaez-Aroche, Pierre-Jean Meyer, Stephen Tu et al.

The sit-to-stand movement is a key feature for wide adoption of powered lower limb orthoses for patients with complete paraplegia. In this paper we study the control of the ascending phase of the sit-to-stand movement for a minimally actuated powered lower limb orthosis at the hips. First, we generate a pool of finite horizon Linear Quadratic Regulator feedback gains, designed under the assumption that we can control not only the torque at the hips but also the loads at the shoulders that in reality are applied by the user. Next we conduct reachability analysis to define a performance metric measuring the robustness of each controller against parameter uncertainty, and choose the best controller from the pool with respect to this metric. Then, we replace the presumed shoulder control with an Iterative Learning Control algorithm as a substitute for human experiments. Indeed this algorithm obtains torque and forces at the shoulders that result in successful simulations of the sit-to-stand movement, regardless of parameter uncertainty and factors deliberately introduced to hinder learning. Thus it is reasonable to expect that the superior cognitive skills of real users will enable them to cooperate with the hip torque controller through training.

SYJan 29, 2018
Exploiting symmetry for discrete-time reachability computations

John Maidens, Murat Arcak

We present a method of computing backward reachable sets for nonlinear discrete-time control systems possessing continuous symmetries. The starting point is a dynamic game formulation of reachability analysis where control inputs aim to maintain the state variables within a target tube despite disturbances. Our method exploits symmetry to compute the reachable sets in a lower-dimensional space, enabling a significant computational speedup. To achieve this, we present a general method for symmetry reduction based on the Cartan frame, which simplifies the dynamic programming iteration without algebraic manipulation of the state update equations. We illustrate the results by computing a backward reachable set for a six-dimensional reach-avoid game of two Dubins vehicles.

SYMar 21, 2016
Semidefinite relaxations in optimal experiment design with application to substrate injection for hyperpolarized MRI

John Maidens, Murat Arcak

We consider the problem of optimal input design for estimating uncertain parameters in a discrete-time linear state space model, subject to simultaneous amplitude and l1/l2-norm constraints on the admissible inputs. We formulate this problem as the maximization of a (non-concave) quadratic function over the space of inputs, and use semidefinite relaxation techniques to efficiently find the global solution or to provide an upper bound. This investigation is motivated by a problem in medical imaging, specifically designing a substrate injection profile for in vivo metabolic parameter mapping using magnetic resonance imaging (MRI) with hyperpolarized carbon-13 pyruvate. In the l2-norm-constrained case, we show that the relaxation is tight, allowing us to efficiently compute a globally optimal injection profile. In the l1-norm-constrained case the relaxation is no longer tight, but can be used to prove that the boxcar injection currently used in practice achieves at least 98.7% of the global optimum.

SYAug 29, 2018
Symmetry reduction for dynamic programming

John Maidens, Axel Barrau, Silvere Bonnabel et al.

We present a method of exploiting symmetries of discrete-time optimal control problems to reduce the dimensionality of dynamic programming iterations. The results are derived for systems with continuous state variables, and can be applied to systems with continuous or discrete symmetry groups. We prove that symmetries of the state update equation and stage costs induce corresponding symmetries of the optimal cost function and the optimal policies. We then provide a general framework for computing the optimal cost function based on gridding a space of lower dimension than the original state space. This method does not require algebraic manipulation of the state update equations; it only requires knowledge of the symmetries that the state update equations possess. Since the method can be performed without any knowledge of the state update map beyond being able to evaluate it and verify its symmetries, this enables the method to be applied in a wide range of application problems. We illustrate these results on two six-dimensional optimal control problems that are computationally difficult to solve by dynamic programming without symmetry reduction.

SYMar 1, 2018
A Benchmark Problem in Transportation Networks

Samuel Coogan, Murat Arcak

In this note, we propose a case study of freeway traffic flow modeled as a hybrid system. We describe two general classes of networks that model flow along a freeway with merging onramps. The admission rate of traffic flow from each onramp is metered via a control input. Both classes of networks are easily scaled to accommodate arbitrary state dimension. The model is discrete-time and possesses piecewise-affine dynamics. Moreover, we present several control objectives that are especially relevant for traffic flow management. The proposed model is flexible and extensible and offers a benchmark for evaluating tools and techniques developed for hybrid systems.

SYMay 28, 2018
Reachability Analysis for Robustness Evaluation of the Sit-To-Stand Movement for Powered Lower Limb Orthoses

Octavio Narvaez-Aroche, Pierre-Jean Meyer, Murat Arcak et al.

A sensitivity-based approach for computing over-approximations of reachable sets, in the presence of constant parameter uncertainties and a single initial state, is used to analyze a three-link planar robot modeling a Powered Lower Limb Orthosis and its user. Given the nature of the mappings relating the state and parameters of the system with the inputs, and outputs describing the trajectories of its Center of Mass, reachable sets for their respective spaces can be obtained relying on the sensitivities of the nonlinear closed-loop dynamics in the state space. These over-approximations are used to evaluate the worst-case performances of a finite time horizon linear-quadratic regulator (LQR) for controlling the ascending phase of the Sit-To-Stand movement.

SYSep 29, 2017
Small Satellite Constellation Separation using Linear Programming based Differential Drag Commands

Emmanuel Sin, Murat Arcak, Andrew Packard

We study the optimal control of an arbitrarily large constellation of small satellites operating in low Earth orbit. Simulating the lack of on-board propulsion, we limit our actuation to the use of differential drag maneuvers to make in-plane changes to the satellite orbits. We propose an efficient method to separate a cluster of satellites into a desired constellation shape while respecting actuation constraints and maximizing the operational lifetime of the constellation. By posing the problem as a linear program, we solve for the optimal drag commands for each of the satellites on a daily basis with a shrinking-horizon model predictive control approach. We then apply this control strategy in a nonlinear orbital dynamics simulation with a simple, varying atmospheric density model. We demonstrate the ability to control a cluster of 100+ satellites starting at the same initial conditions in a circular low Earth orbit to form an equally spaced constellation (with a relative angular separation error tolerance of one-tenth a degree). The constellation separation task can be executed in 71 days, a time frame that is competitive for the state-of-the-practice. This method allows us to trade the time required to converge to the desired constellation with a sacrifice in the overall constellation lifetime, measured as the maximum altitude loss experienced by one of the satellites in the group after the separation maneuvers.

SYMay 24, 2018
Finite Time Robust Control of the Sit-to-Stand Movement for Powered Lower Limb Orthoses

Octavio Narvaez-Aroche, Andrew Packard, Murat Arcak

This study presents a technique to safely control the Sit-to-Stand movement of powered lower limb orthoses in the presence of parameter uncertainty. The weight matrices used to calculate the finite time horizon linear-quadratic regulator (LQR) gain in the feedback loop are chosen from a pool of candidates as to minimize a robust performance metric involving induced gains that measure the deviation of variables of interest in a linear time-varying (LTV) system, at specific times within a finite horizon, caused by a perturbation signal modeling the variation of the parameters. Two relevant Sit-to-Stand movements are simulated for drawing comparisons with the results documented in a previous work.

SYMay 23, 2019
Flexible Computational Pipelines for Robust Abstraction-Based Control Synthesis

Eric S. Kim, Murat Arcak, Sanjit A. Seshia

Successfully synthesizing controllers for complex dynamical systems and specifications often requires leveraging domain knowledge as well as making difficult computational or mathematical tradeoffs. This paper presents a flexible and extensible framework for constructing robust control synthesis algorithms and applies this to the traditional abstraction-based control synthesis pipeline. It is grounded in the theory of relational interfaces and provides a principled methodology to seamlessly combine different techniques (such as dynamic precision grids, refining abstractions while synthesizing, or decomposed control predecessors) or create custom procedures to exploit an application's intrinsic structural properties. A Dubins vehicle is used as a motivating example to showcase memory and runtime improvements.

SYJul 26, 2018
Abstractions for Symbolic Controller Synthesis are Composable

Eric S. Kim, Murat Arcak

Translating continuous control system models into finite automata allows us to use powerful discrete tools to synthesize controllers for complex specifications. The abstraction construction step is unfortunately hamstrung by high runtime and memory requirements for high dimensional systems. This paper describes how the transition relation encoding the abstract system dynamics can be generated by connecting smaller abstract modules in series and parallel. We provide a composition operation and show that composing a collection of abstract modules yields another abstraction satisfying a feedback refinement relation. Through compositionality we circumvent the acute computational cost of directly abstracting a high dimensional system and also modularize the abstraction construction pipeline.

SYApr 12, 2018
Automatic Generation of Communication Requirements for Enforcing Multi-Agent Safety

Eric S. Kim, Murat Arcak, Sanjit A. Seshia et al.

Distributed controllers are often necessary for a multi-agent system to satisfy safety properties such as collision avoidance. Communication and coordination are key requirements in the implementation of a distributed control protocol, but maintaining an all-to-all communication topology is unreasonable and not always necessary. Given a safety objective and a controller implementation, we consider the problem of identifying when agents need to communicate with one another and coordinate their actions to satisfy the safety constraint. We define a coordination-free controllable predecessor operator that is used to derive a subset of the state space that allows agents to act independently, without consulting other agents to double check that the action is safe. Applications are shown for identifying an upper bound on connection delays and a self-triggered coordination scheme. Examples are provided which showcase the potential for designers to visually interpret a system's ability to tolerate delays when initializing a network connection.

SYNov 15, 2017
Contraction-based Observers using non-Euclidean Norms with an Application to Traffic Networks

Samuel Coogan, Murat Arcak

In this note, we study Luenberger-type full-state observers for nonlinear systems using contraction theory. We show that if the matrix measure of a suitably defined Jacobian matrix constructed from the dynamics of the system-observer interconnection is uniformly negative, then the state estimate converges exponentially to the actual state. This sufficient condition for convergence establishes that the distance between the estimate and state is infinitesimally contracting with respect to some norm on the state-space. In contrast to existing results for contraction-based observer design, we allow for contraction with respect to non-Euclidean norms. Such norms have proven useful in applications. To demonstrate our results, we study the problem of observing vehicular traffic density along a freeway modeled as interconnected, spatially homogenous compartments, and our approach relies on establishing contraction of the system-observer interconnection with respect to the one-norm.

2.7ROApr 21
Multi-Step Gaussian Process Propagation for Adaptive Path Planning

Alex Beaudin, Bjørn Andreas Kristiansen, Kristoffer Gryte et al.

Efficient and robust path planning hinges on combining all accessible information sources. In particular, the task of path planning for robotic environmental exploration and monitoring depends highly on the current belief of the world. To capture the uncertainty in the belief, we present a Gaussian process based path planning method that adapts to multi-modal environmental sensing data and incorporates state and input constraints. To solve the path planning problem, we optimize over future waypoints in a receding horizon fashion, and our cost is thus a function of the Gaussian process posterior over all these waypoints. We demonstrate this method, dubbed OLAhGP, on an autonomous surface vessel using oceanic algal bloom data from both a high-fidelity model and in-situ sensing data in a monitoring scenario. Our simulated and experimental results demonstrate significant improvement over existing methods. With the same number of samples, our method generates more informative paths and achieves greater accuracy in identifying algal blooms in chlorophyll a rich waters, measured with respect to total misclassification probability and binary misclassification rate over the domain of interest.

SYDec 12, 2025
Congestion Reduction in EV Charger Placement Using Traffic Equilibrium Models

Semih Kara, Yasin Sonmez, Can Kizilkale et al.

Growing EV adoption can worsen traffic conditions if chargers are sited without regard to their impact on congestion. We study how to strategically place EV chargers to reduce congestion using two equilibrium models: one based on congestion games and one based on an atomic queueing simulation. We apply both models within a scalable greedy station-placement algorithm. Experiments show that this greedy scheme yields optimal or near-optimal congestion outcomes in realistic networks, even though global optimality is not guaranteed as we show with a counterexample. We also show that the queueing-based approach yields more realistic results than the congestion-game model, and we present a unified methodology that calibrates congestion delays from queue simulation and solves equilibrium in link-space.

25.4SYApr 19
Controlled Invariant Sets for Gaussian Process State Space Models

Paul Griffioen, Bingzhuo Zhong, Murat Arcak et al.

We compute probabilistic controlled invariant sets for nonlinear systems using Gaussian process state space models, which are data-driven models that account for unmodeled and unknown nonlinear dynamics. We propose a semidefinite programming scheme for designing state-feedback controllers that maximize the probability of the trajectories staying within a probabilistic controlled invariant set while satisfying input constraints. The results are validated on a quadrotor, both in simulation and on a physical platform.

3.5SYApr 3
Probably Approximately Correct (PAC) Guarantees for Data-Driven Reachability Analysis: A Theoretical and Empirical Comparison

Elizabeth Dietrich, Hanna Krasowski, Murat Arcak

Reachability analysis evaluates system safety, by identifying the set of states a system may evolve within over a finite time horizon. In contrast to model-based reachability analysis, data-driven reachability analysis estimates reachable sets and derives probabilistic guarantees directly from data. Several popular techniques for validating reachable sets -- conformal prediction, scenario optimization, and the holdout method -- admit similar Probably Approximately Correct (PAC) guarantees. We establish a formal connection between these PAC bounds and present an empirical case study on reachable sets to illustrate the computational and sample trade-offs associated with these methods. We argue that despite the formal relationship between these techniques, subtle differences arise in both the interpretation of guarantees and the parameterization. As a result, these methods are not generally interchangeable. We conclude with practical advice on the usage of these methods.

17.4SYApr 3
Importance Sampling for Statistical Certification of Viable Initial Sets

Elizabeth Dietrich, Hanna Krasowski, Vegard Flovik et al.

We study the problem of statistically certifying viable initial sets (VISs) -- sets of initial conditions whose trajectories satisfy a given control specification. While VISs can be obtained from model-based methods, these methods typically rely on simplified models. We propose a simulation-based framework to certify VISs by estimating the probability of specification violations under a high-fidelity or black-box model. Since detecting these violations may be challenging due to their scarcity, we propose a sample-efficient framework that leverages importance sampling to target high-risk regions. We derive an empirical Bernstein inequality for weighted random variables, enabling finite-sample guarantees for importance sampling estimators. We demonstrate the effectiveness of the proposed approach on two systems and show improved convergence of the resulting bounds on an Adaptive Cruise Control benchmark.

SYSep 13, 2024
Stability Margins of Neural Network Controllers

Neelay Junnarkar, Murat Arcak, Peter Seiler

We present a method to train neural network controllers with guaranteed stability margins. The method is applicable to linear time-invariant plants interconnected with uncertainties and nonlinearities that are described by integral quadratic constraints. The type of stability margin we consider is the disk margin. Our training method alternates between a training step to maximize reward and a stability margin-enforcing step. In the stability margin enforcing-step, we solve a semidefinite program to project the controller into the set of controllers for which we can certify the desired disk margin.

24.1OCApr 20
Target Mirror Descent: A Unifying Framework for Solving Monotone Variational Inequalities

Yu-Wen Chen, Can Kizilkale, Murat Arcak

It is well known that mirror descent may diverge or cycle on merely monotone variational inequalities. In this paper, we propose \emph{Target Mirror Descent} (TMD), a unified framework that stabilizes monotone flows via a target point correction mechanism in the dual update. By appropriate design choices, TMD recovers the proximal point algorithm, extragradient methods, splitting methods, Brown-von Neumann-Nash dynamics, forward-backward-forward dynamics, and discounted mirror descent as special cases. Thus, we establish a unified perspective on these landmark algorithms and their convergence. Beyond unification, we leverage the TMD framework to correct an equilibrium misalignment in discounted mirror descent and to generalize its higher-order extension beyond interior solutions. Moreover, a key structural feature of TMD is the explicit decoupling of the mirror map from the target determination, which enables \emph{geometric ensembles}: multiple algorithms solve the same problem in parallel using distinct mirror maps, while sharing a common dual update. We show that such an ensemble rigorously reduces to a single TMD with a synthesized mirror map, and thus inherits these convergence guarantees.

77.8SYApr 30
Hierarchical Control for Continuous-time Systems via General Approximate Alternating Simulation Relations

Zhiyuan Huang, Shuo Li, Murat Arcak et al.

This paper introduces a general approximate alternating simulation relation (\emph{$\varepsilon$-gAAS relation}) for continuous-time systems, which relaxes existing simulation relations to tolerate larger mismatches between abstract and concrete models. The definition of gAAS for continuous-time systems is first proposed, and its properties are investigated. Then, a control refinement method is developed to enable hierarchical control for the gAAS relation. Finally, case studies demonstrate the effectiveness of the proposed approach, highlighting its advantages over existing methods.

SYApr 10, 2024
Synthesizing Neural Network Controllers with Closed-Loop Dissipativity Guarantees

Neelay Junnarkar, Murat Arcak, Peter Seiler

In this paper, a method is presented to synthesize neural network controllers such that the feedback system of plant and controller is dissipative, certifying performance requirements such as L2 gain bounds. The class of plants considered is that of linear time-invariant (LTI) systems interconnected with an uncertainty, including nonlinearities treated as an uncertainty for convenience of analysis. The uncertainty of the plant and the nonlinearities of the neural network are both described using integral quadratic constraints (IQCs). First, a dissipativity condition is derived for uncertain LTI systems. Second, this condition is used to construct a linear matrix inequality (LMI) which can be used to synthesize neural network controllers. Finally, this convex condition is used in a projection-based training method to synthesize neural network controllers with dissipativity guarantees. Numerical examples on an inverted pendulum and a flexible rod on a cart are provided to demonstrate the effectiveness of this approach.

LGMar 27, 2024
Exploiting Symmetry in Dynamics for Model-Based Reinforcement Learning with Asymmetric Rewards

Yasin Sonmez, Neelay Junnarkar, Murat Arcak

Recent work in reinforcement learning has leveraged symmetries in the model to improve sample efficiency in training a policy. A commonly used simplifying assumption is that the dynamics and reward both exhibit the same symmetry; however, in many real-world environments, the dynamical model exhibits symmetry independent of the reward model. In this paper, we assume only the dynamics exhibit symmetry, extending the scope of problems in reinforcement learning and learning in control theory to which symmetry techniques can be applied. We use Cartan's moving frame method to introduce a technique for learning dynamics that, by construction, exhibit specified symmetries. Numerical experiments demonstrate that the proposed method learns a more accurate dynamical model

SYOct 8, 2025
Falsification-Driven Reinforcement Learning for Maritime Motion Planning

Marlon Müller, Florian Finkeldei, Hanna Krasowski et al.

Compliance with maritime traffic rules is essential for the safe operation of autonomous vessels, yet training reinforcement learning (RL) agents to adhere to them is challenging. The behavior of RL agents is shaped by the training scenarios they encounter, but creating scenarios that capture the complexity of maritime navigation is non-trivial, and real-world data alone is insufficient. To address this, we propose a falsification-driven RL approach that generates adversarial training scenarios in which the vessel under test violates maritime traffic rules, which are expressed as signal temporal logic specifications. Our experiments on open-sea navigation with two vessels demonstrate that the proposed approach provides more relevant training scenarios and achieves more consistent rule compliance.

38.2SYApr 1
Polynomial Constraints for Robustness Analysis of Nonlinear Systems

Neelay Junnarkar, Peter Seiler, Murat Arcak

This paper presents a framework for abstracting uncertain or non-polynomial components of dynamical systems using polynomial constraints. This enables the application of polynomial-based analysis tools, such as sum-of-squares programming, to a broader class of non-polynomial systems. A numerical method for constructing these constraints is proposed. The relationship between polynomial constraints and existing integral quadratic constraints (IQCs) is investigated, providing transformations of IQCs into polynomial constraints. The effectiveness of polynomial constraints in characterizing nonlinearities is validated via numerical examples to compute inner estimates of the region of attraction for two systems.

25.8SYApr 1
Learning Neural Network Controllers with Certified Robust Performance via Adversarial Training

Neelay Junnarkar, Yasin Sonmez, Murat Arcak

Neural network (NN) controllers achieve strong empirical performance on nonlinear dynamical systems, yet deploying them in safety-critical settings requires robustness to disturbances and uncertainty. We present a method for jointly synthesizing NN controllers and dissipativity certificates that formally guarantee robust closed-loop performance using adversarial training, in which we use counterexamples to the robust dissipativity condition to guide training. Verification is done post-training using alpha,beta-CROWN, a branch-and-bound-based method that enables direct analysis of the nonlinear dynamical system. The proposed method uses quadratic constraints (QCs) only for characterization of non-parametric uncertainties. The method is tested in numerical experiments on maximizing the volume of the set on which a system is certified to be robustly dissipative. Our method certifies regions up to 78 times larger than the region certified by a linear matrix inequality-based approach that we derive for comparison.

LGSep 5, 2025
STL-based Optimization of Biomolecular Neural Networks for Regression and Control

Eric Palanques-Tost, Hanna Krasowski, Murat Arcak et al.

Biomolecular Neural Networks (BNNs), artificial neural networks with biologically synthesizable architectures, achieve universal function approximation capabilities beyond simple biological circuits. However, training BNNs remains challenging due to the lack of target data. To address this, we propose leveraging Signal Temporal Logic (STL) specifications to define training objectives for BNNs. We build on the quantitative semantics of STL, enabling gradient-based optimization of the BNN weights, and introduce a learning algorithm that enables BNNs to perform regression and control tasks in biological systems. Specifically, we investigate two regression problems in which we train BNNs to act as reporters of dysregulated states, and a feedback control problem in which we train the BNN in closed-loop with a chronic disease model, learning to reduce inflammation while avoiding adverse responses to external infections. Our numerical experiments demonstrate that STL-based learning can solve the investigated regression and control tasks efficiently.

OCJul 30, 2025
Set Invariance with Probability One for Controlled Diffusion: Score-based Approach

Wenqing Wang, Alexis M. H. Teter, Murat Arcak et al.

Given a controlled diffusion and a connected, bounded, Lipschitz set, when is it possible to guarantee controlled set invariance with probability one? In this work, we answer this question by deriving the necessary and sufficient conditions for the same in terms of gradients of certain log-likelihoods -- a.k.a. score vector fields -- for two cases: given finite time horizon and infinite time horizon. The deduced conditions comprise a score-based test that provably certifies or falsifies the existence of Markovian controllers for given controlled set invariance problem data. Our results are constructive in the sense when the problem data passes the proposed test, we characterize all controllers guaranteeing the desired set invariance. When the problem data fails the proposed test, there does not exist a controller that can accomplish the desired set invariance with probability one. The computation in the proposed tests involve solving certain Dirichlet boundary value problems, and in the finite horizon case, can also account for additional constraint of hitting a target subset at the terminal time. We illustrate the results using several semi-analytical and numerical examples.

ROMar 8, 2025
Learning to Drive by Imitating Surrounding Vehicles

Yasin Sonmez, Hanna Krasowski, Murat Arcak

Imitation learning is a promising approach for training autonomous vehicles (AV) to navigate complex traffic environments by mimicking expert driver behaviors. While existing imitation learning frameworks focus on leveraging expert demonstrations, they often overlook the potential of additional complex driving data from surrounding traffic participants. In this paper, we study a data augmentation strategy that leverages the observed trajectories of nearby vehicles, captured by the AV's sensors, as additional demonstrations. We introduce a simple vehicle-selection sampling and filtering strategy that prioritizes informative and diverse driving behaviors, contributing to a richer dataset for training. We evaluate this idea with a representative learning-based planner on a large real-world dataset and demonstrate improved performance in complex driving scenarios. Specifically, the approach reduces collision rates and improves safety metrics compared to the baseline. Notably, even when using only 10 percent of the original dataset, the method matches or exceeds the performance of the full dataset. Through ablations, we analyze selection criteria and show that naive random selection can degrade performance. Our findings highlight the value of leveraging diverse real-world trajectory data in imitation learning and provide insights into data augmentation strategies for autonomous driving.

LGMay 17, 2023
Exact Recovery for System Identification with More Corrupt Data than Clean Data

Baturalp Yalcin, Haixiang Zhang, Javad Lavaei et al.

This paper investigates the system identification problem for linear discrete-time systems under adversaries and analyzes two lasso-type estimators. We examine both asymptotic and non-asymptotic properties of these estimators in two separate scenarios, corresponding to deterministic and stochastic models for the attack times. Since the samples collected from the system are correlated, the existing results on lasso are not applicable. We prove that when the system is stable and attacks are injected periodically, the sample complexity for exact recovery of the system dynamics is linear in terms of the dimension of the states. When adversarial attacks occur at each time instance with probability p, the required sample complexity for exact recovery scales polynomially in the dimension of the states and the probability p. This result implies almost sure convergence to the true system dynamics under the asymptotic regime. As a by-product, our estimators still learn the system correctly even when more than half of the data is compromised. We highlight that the attack vectors are allowed to be correlated with each other in this work, whereas we make some assumptions about the times at which the attacks happen. This paper provides the first mathematical guarantee in the literature on learning from correlated data for dynamical systems in the case when there is less clean data than corrupt data.

SYMar 31, 2022
Synthesis of Stabilizing Recurrent Equilibrium Network Controllers

Neelay Junnarkar, He Yin, Fangda Gu et al.

We propose a parameterization of a nonlinear dynamic controller based on the recurrent equilibrium network, a generalization of the recurrent neural network. We derive constraints on the parameterization under which the controller guarantees exponential stability of a partially observed dynamical system with sector bounded nonlinearities. Finally, we present a method to synthesize this controller using projected policy gradient methods to maximize a reward function with arbitrary structure. The projection step involves the solution of convex optimization problems. We demonstrate the proposed method with simulated examples of controlling nonlinear plants, including plants modeled with neural networks.

SYDec 18, 2021
Data-Driven Reachability analysis and Support set Estimation with Christoffel Functions

Alex Devonport, Forest Yang, Laurent El Ghaoui et al.

We present algorithms for estimating the forward reachable set of a dynamical system using only a finite collection of independent and identically distributed samples. The produced estimate is the sublevel set of a function called an empirical inverse Christoffel function: empirical inverse Christoffel functions are known to provide good approximations to the support of probability distributions. In addition to reachability analysis, the same approach can be applied to general problems of estimating the support of a random variable, which has applications in data science towards detection of novelties and outliers in data sets. In applications where safety is a concern, having a guarantee of accuracy that holds on finite data sets is critical. In this paper, we prove such bounds for our algorithms under the Probably Approximately Correct (PAC) framework. In addition to applying classical Vapnik-Chervonenkis (VC) dimension bound arguments, we apply the PAC-Bayes theorem by leveraging a formal connection between kernelized empirical inverse Christoffel functions and Gaussian process regression models. The bound based on PAC-Bayes applies to a more general class of Christoffel functions than the VC dimension argument, and achieves greater sample efficiency in experiments.

SYSep 8, 2021
Recurrent Neural Network Controllers Synthesis with Stability Guarantees for Partially Observed Systems

Fangda Gu, He Yin, Laurent El Ghaoui et al.

Neural network controllers have become popular in control tasks thanks to their flexibility and expressivity. Stability is a crucial property for safety-critical dynamical systems, while stabilization of partially observed systems, in many cases, requires controllers to retain and process long-term memories of the past. We consider the important class of recurrent neural networks (RNN) as dynamic controllers for nonlinear uncertain partially-observed systems, and derive convex stability conditions based on integral quadratic constraints, S-lemma and sequential convexification. To ensure stability during the learning and control process, we propose a projected policy gradient method that iteratively enforces the stability conditions in the reparametrized space taking advantage of mild additional information on system dynamics. Numerical experiments show that our method learns stabilizing controllers while using fewer samples and achieving higher final performance compared with policy gradient.

AIApr 28, 2021
Symbolic Abstractions From Data: A PAC Learning Approach

Alex Devonport, Adnane Saoud, Murat Arcak

Symbolic control techniques aim to satisfy complex logic specifications. A critical step in these techniques is the construction of a symbolic (discrete) abstraction, a finite-state system whose behaviour mimics that of a given continuous-state system. The methods used to compute symbolic abstractions, however, require knowledge of an accurate closed-form model. To generalize them to systems with unknown dynamics, we present a new data-driven approach that does not require closed-form dynamics, instead relying only the ability to evaluate successors of each state under given inputs. To provide guarantees for the learned abstraction, we use the Probably Approximately Correct (PAC) statistical framework. We first introduce a PAC-style behavioural relationship and an appropriate refinement procedure. We then show how the symbolic abstraction can be constructed to satisfy this new behavioural relationship. Moreover, we provide PAC bounds that dictate the number of data required to guarantee a prescribed level of accuracy and confidence. Finally, we present an illustrative example.

SYSep 30, 2020
Co-design of Control and Planning for Multi-rotor UAVs with Signal Temporal Logic Specifications

Yash Vardhan Pant, He Yin, Murat Arcak et al.

Urban Air Mobility (UAM), or the scenario where multiple manned and Unmanned Aerial Vehicles (UAVs) carry out various tasks over urban airspaces, is a transportation concept of the future that is gaining prominence. UAM missions with complex spatial, temporal and reactive requirements can be succinctly represented using Signal Temporal Logic (STL), a behavioral specification language. However, planning and control of systems with STL specifications is computationally intensive, usually resulting in planning approaches that do not guarantee dynamical feasibility, or control approaches that cannot handle complex STL specifications. Here, we present an approach to co-design the planner and control such that a given STL specification (possibly over multiple UAVs) is satisfied with trajectories that are dynamically feasible and our controller can track them with a bounded tracking-error that the planner accounts for. The tracking controller is formulated for the non-linear dynamics of the individual UAVs, and the tracking error bound is computed for this controller when the trajectories satisfy some kinematic constraints. We also augment an existing multi-UAV STL-based trajectory generator in order to generate trajectories that satisfy such constraints. We show that this co-design allows for trajectories that satisfy a given STL specification, and are also dynamically feasible in the sense that they can be tracked with bounded error. The applicability of this approach is demonstrated through simulations of multi-UAV missions.

SYSep 30, 2018
Finite Horizon Backward Reachability Analysis and Control Synthesis for Uncertain Nonlinear Systems

He Yin, Andrew Packard, Murat Arcak et al.

We present a method for synthesizing controllers to steer trajectories from an initial set to a target set on a finite time horizon. The proposed control synthesis problem is decomposed into two steps. The first step under-approximates the backward reachable set (BRS) from the target set, using level sets of storage functions. The storage function is constructed with an iterative algorithm to maximize the volume of the under-approximated BRS. The second step obtains a control law by solving a pointwise min-norm optimization problem using the pre-computed storage function. A closed-form solution of this min-norm optimization can be computed through the KKT conditions. This control synthesis framework is then extended to uncertain nonlinear systems with parametric uncertainties and L_2 disturbances. The computation algorithm for all cases is derived using sum-of-squares (SOS) programming and the S-procedure. The proposed method is applied to several robotics and aircraft examples.

SYAug 26, 2015
Compositional Performance Certification of Interconnected Systems using ADMM

Chris Meissen, Laurent Lessard, Murat Arcak et al.

A compositional performance certification method is presented for interconnected systems using subsystem dissipativity properties and the interconnection structure. A large-scale optimization problem is formulated to search for the most relevant dissipativity properties. The alternating direction method of multipliers (ADMM) is employed to decompose and solve this problem, and is demonstrated on several examples.

SYMay 22, 2015
A Compartmental Model for Traffic Networks and its Dynamical Behavior

Samuel Coogan, Murat Arcak

We propose a macroscopic traffic network flow model suitable for analysis as a dynamical system, and we qualitatively analyze equilibrium flows as well as convergence. Flows at a junction are determined by downstream supply of capacity as well as upstream demand of traffic wishing to flow through the junction. This approach is rooted in the celebrated Cell Transmission Model for freeway traffic flow. Unlike related results which rely on certain system cooperativity properties, our model generally does not possess these properties. We show that the lack of cooperativity is in fact a useful feature that allows traffic control methods, such as ramp metering, to be effective. Finally, we leverage the results of the paper to develop a linear program for optimal ramp metering.