Arman Sharifi Kolarijani

h-index27
5papers

5 Papers

OCSep 23, 2019
Continuous-Time Accelerated Methods via a Hybrid Control Lens

Arman Sharifi Kolarijani, Peyman Mohajerin Esfahani, Tamás Keviczky

Treating optimization methods as dynamical systems can be traced back centuries ago in order to comprehend the notions and behaviors of optimization methods. Lately, this mind set has become the driving force to design new optimization methods. Inspired by the recent dynamical system viewpoint of Nesterov's fast method, we propose two classes of fast methods, formulated as hybrid control systems, to obtain pre-specified exponential convergence rate. Alternative to the existing fast methods which are parametric-in-time second order differential equations, we dynamically synthesize feedback controls in a state-dependent manner. Namely, in the first class the damping term is viewed as the control input, while in the second class the amplitude with which the gradient of the objective function impacts the dynamics serves as the controller. The objective function requires to satisfy the so-called Polyak--Łojasiewicz inequality which effectively implies no local optima and a certain gradient-domination property. Moreover, we establish that both hybrid structures possess Zeno-free solution trajectories. We finally provide a mechanism to determine the discretization step size to attain an exponential convergence rate.

OCMay 1, 2022
Adaptive Composite Online Optimization: Predictions in Static and Dynamic Environments

Pedro Zattoni Scroccaro, Arman Sharifi Kolarijani, Peyman Mohajerin Esfahani

In the past few years, Online Convex Optimization (OCO) has received notable attention in the control literature thanks to its flexible real-time nature and powerful performance guarantees. In this paper, we propose new step-size rules and OCO algorithms that simultaneously exploit gradient predictions, function predictions and dynamics, features particularly pertinent to control applications. The proposed algorithms enjoy static and dynamic regret bounds in terms of the dynamics of the reference action sequence, gradient prediction error, and function prediction error, which are generalizations of known regularity measures from the literature. We present results for both convex and strongly convex costs. We validate the performance of the proposed algorithms in a trajectory tracking case study, as well as portfolio optimization using real-world datasets.

OCMay 3, 2025
Rank-One Modified Value Iteration

Arman Sharifi Kolarijani, Tolga Ok, Peyman Mohajerin Esfahani et al.

In this paper, we provide a novel algorithm for solving planning and learning problems of Markov decision processes. The proposed algorithm follows a policy iteration-type update by using a rank-one approximation of the transition probability matrix in the policy evaluation step. This rank-one approximation is closely related to the stationary distribution of the corresponding transition probability matrix, which is approximated using the power method. We provide theoretical guarantees for the convergence of the proposed algorithm to optimal (action-)value function with the same rate and computational complexity as the value iteration algorithm in the planning problem and as the Q-learning algorithm in the learning problem. Through our extensive numerical simulations, however, we show that the proposed algorithm consistently outperforms first-order algorithms and their accelerated versions for both planning and learning problems.

SYSep 12, 2016
Technical Report: Timing Abstraction of Perturbed LTI systems with $\mathcal{L}_2$-based Event-Triggering Mechanism

Arman Sharifi Kolarijani, Manuel Mazo, Tamas Keviczky

In networked control systems, the advent of event-triggering strategies in the sampling process has resulted in the usage reduction of network capacities, such as communication bandwidth. However, the aperiodic nature of sampling periods generated by event-triggering strategies has hindered the schedulability of such networks. In this study, we propose a framework to construct a timed safety automaton that captures the sampling behavior of perturbed LTI systems with an $\mathcal{L}_2$-based triggering mechanisms proposed in the Literature. In this framework, the state-space is partitioned into a finite number of convex polyhedral cones, each cone representing a discrete mode in the abstracted automaton. Adopting techniques from stability analysis of retarded systems accompanied with a polytopic embedding of time, LMI conditions to characterize the sampling interval associated with each region are derived. Then, using reachability analysis, the transitions in the abstracted automaton are derived.

SYJun 15, 2015
A Formal Traffic Characterization of LTI Event-triggered Control Systems

Arman Sharifi Kolarijani, Manuel Mazo

Unnecessary communication and computation in the periodic execution of control tasks lead to over-provisioning in hardware design (or underexploitation in hardware utilization) in control applications, such as networked control systems. To address these issues, researchers have proposed a new class of strategies, named event-driven strategies. Despite of their beneficiary effects, matters like task scheduling and appropriate dimensioning of communication components have become more complicated with respect to traditional periodic strategies. In this paper, we present a formal approach to derive an abstracted system that captures the sampling behavior of a family of event-triggered strategies for the case of LTI systems. This structure approximately simulates the sampling behavior of the aperiodic control system. Furthermore, the resulting quotient system is equivalent to a timed automaton. In the construction of the abstraction, the state space is confined to a finite number of convex regions, each of which represents a mode in the quotient system. An LMI-based technique is deployed to derive a sampling time interval associated to each region. Finally, reachability analysis is leveraged to find the transitions of the quotient system.