Raman Goyal

LG
6papers
31citations
Novelty53%
AI Score25

6 Papers

SYJan 31, 2023
Benchmarking Model Predictive Control Algorithms in Building Optimization Testing Framework (BOPTEST)

Saman Mostafavi, Chihyeon Song, Aayushman Sharma et al.

We present a data-driven modeling and control framework for physics-based building emulators. Our approach consists of: (a) Offline training of differentiable surrogate models that accelerate model evaluations, provide cost-effective gradients, and maintain good predictive accuracy for the receding horizon in Model Predictive Control (MPC), and (b) Formulating and solving nonlinear building HVAC MPC problems. We extensively evaluate the modeling and control performance using multiple surrogate models and optimization frameworks across various test cases available in the Building Optimization Testing Framework (BOPTEST). Our framework is compatible with other modeling techniques and can be customized with different control formulations, making it adaptable and future-proof for test cases currently under development for BOPTEST. This modularity provides a path towards prototyping predictive controllers in large buildings, ensuring scalability and robustness in real-world applications.

ROJul 16, 2021
An Information-state based Approach to the Optimal Output Feedback Control of Nonlinear Systems

Raman Goyal, Ran Wang, Mohamed Naveed Gul Mohamed et al.

This paper develops a data-based approach to the closed-loop output feedback control of nonlinear dynamical systems with a partial nonlinear observation model. We propose an information state based approach to rigorously transform the partially observed problem into a fully observed problem where the information state consists of the past several observations and control inputs. We further show the equivalence of the transformed and the initial partially observed optimal control problems and provide the conditions to solve for the deterministic optimal solution. We develop a data based generalization of the iterative Linear Quadratic Regulator (iLQR) to partially observed systems using a local linear time varying model of the information state dynamics approximated by an Autoregressive moving average (ARMA) model, that is generated using only the input-output data. This open-loop trajectory optimization solution is then used to design a local feedback control law, and the composite law then provides an optimum solution to the partially observed feedback design problem. The efficacy of the developed method is shown by controlling complex high dimensional nonlinear dynamical systems in the presence of model and sensing uncertainty.

LGNov 21, 2020
On the Convergence of Reinforcement Learning in Nonlinear Continuous State Space Problems

Raman Goyal, Suman Chakravorty, Ran Wang et al.

We consider the problem of Reinforcement Learning for nonlinear stochastic dynamical systems. We show that in the RL setting, there is an inherent ``Curse of Variance" in addition to Bellman's infamous ``Curse of Dimensionality", in particular, we show that the variance in the solution grows factorial-exponentially in the order of the approximation. A fundamental consequence is that this precludes the search for anything other than ``local" feedback solutions in RL, in order to control the explosive variance growth, and thus, ensure accuracy. We further show that the deterministic optimal control has a perturbation structure, in that the higher order terms do not affect the calculation of lower order terms, which can be utilized in RL to get accurate local solutions.

RONov 7, 2020
Robust Shape Control of Gyroscopic Tensegrity Robotic Arm

Raman Goyal, Manoranjan Majji, Robert E. Skelton

This paper proposes a model-based approach to control the shape of a tensegrity system by driving its node position locations. The nonlinear dynamics of the tensegrity system is used to regulate position, velocity, and acceleration to the specified reference trajectory. State feedback control design is used to obtain the solution for the control variable as a linear programming problem. Shape control for the gyroscopic tensegrity systems is discussed, and it is observed that these systems increase the reachable space for the structure by providing independent control over certain rotational degrees of freedom. Disturbance rejection of the tensegrity system is further studied in the paper. A methodology to calculate the control gains to bound the errors for five different types of problems is provided. The formulation uses a Linear Matrix Inequality (LMI) approach to stipulate the desired performance bounds on the error for $\mathcal{H}_\infty$, generalized $\mathcal{H}_2$, LQR, covariance control and stabilizing control problem. A high degree of freedom tensegrity $T_2D_1$ robotic arm is used as an example to show the efficacy of the formulation.

SYApr 1, 2020
On the Feedback Law in Stochastic Optimal Nonlinear Control

Mohamed Naveed Gul Mohamed, Suman Chakravorty, Raman Goyal et al.

We consider the problem of nonlinear stochastic optimal control. This problem is thought to be fundamentally intractable owing to Bellman's "curse of dimensionality". We present a result that shows that repeatedly solving an open-loop deterministic problem from the current state with progressively shorter horizons, similar to Model Predictive Control (MPC), results in a feedback policy that is $O(ε^4)$ near to the true global stochastic optimal policy, where $ε$ is a perturbation parameter modulating the noise. We also show that the optimal deterministic feedback problem has a perturbation structure such that higher-order terms of the feedback law do not affect lower-order terms and that this structure is lost in the optimal stochastic feedback problem. Consequently, solving the Stochastic Dynamic Programming problem is highly susceptible to noise, even in low dimensional problems, and in practice, the MPC-type feedback law offers superior performance even for high noise levels.

LGFeb 21, 2020
On the Search for Feedback in Reinforcement Learning

Ran Wang, Karthikeya S. Parunandi, Aayushman Sharma et al.

The problem of Reinforcement Learning (RL) in an unknown nonlinear dynamical system is equivalent to the search for an optimal feedback law utilizing the simulations/ rollouts of the dynamical system. Most RL techniques search over a complex global nonlinear feedback parametrization making them suffer from high training times as well as variance. Instead, we advocate searching over a local feedback representation consisting of an open-loop sequence, and an associated optimal linear feedback law completely determined by the open-loop. We show that this alternate approach results in highly efficient training, the answers obtained are repeatable and hence reliable, and the resulting closed performance is superior to global state-of-the-art RL techniques. Finally, if we replan, whenever required, which is feasible due to the fast and reliable local solution, it allows us to recover global optimality of the resulting feedback law.