OCFeb 20, 2018
Sparsity Preserving Optimal Control of Discretized PDE SystemsAleksandar Haber, Michel Verhaegen
We focus on the problem of optimal control of large-scale systems whose models are obtained by discretization of partial differential equations using the Finite Element (FE) or Finite Difference (FD) methods. The motivation for studying this pressing problem originates from the fact that the classical numerical tools used to solve low-dimensional optimal control problems are computationally infeasible for large-scale systems. Furthermore, although the matrices of large-scale FE or FD models are usually sparse banded or highly structured, the optimal control solution computed using the classical methods is dense and unstructured. Consequently, it is not suitable for efficient centralized and distributed real-time implementations. We show that the a priori (sparsity) patterns of the exact solutions of the generalized Lyapunov equations for FE and FD models are banded matrices. The a priori pattern predicts the dominant non-zero entries of the exact solution. We furthermore show that for well-conditioned problems, the a priori patterns are not only banded but also sparse matrices. On the basis of these results, we develop two computationally efficient methods for computing sparse approximate solutions of generalized Lyapunov equations. Using these two methods and the inexact Newton method, we show that the solution of the generalized Riccati equation can be approximated by a banded matrix. This enables us to develop a novel computationally efficient optimal control approach that is able to preserve the sparsity of the control law. We perform extensive numerical experiments that demonstrate the effectiveness of our approach.
SYFeb 8, 2016
Mean and variance of the LQG cost functionHildo Bijl, Jan Willem van Wingerden, Thomas B. Schön et al.
Linear Quadratic Gaussian (LQG) systems are well-understood and methods to minimize the expected cost are readily available. Less is known about the statistical properties of the resulting cost function. The contribution of this paper is a set of analytic expressions for the mean and variance of the LQG cost function. These expressions are derived using two different methods, one using solutions to Lyapunov equations and the other using only matrix exponentials. Both the discounted and the non-discounted cost function are considered, as well as the finite-time and the infinite-time cost function. The derived expressions are successfully applied to an example system to reduce the probability of the cost exceeding a given threshold.
SYFeb 24, 2016
Robust Air Data Sensor Fault Diagnosis With Enhanced Fault Sensitivity Using Moving Horizon EstimationYiming Wan, Tamas Keviczky, Michel Verhaegen
This paper investigates robust fault diagnosis of multiple air data sensor faults in the presence of winds. The trade-off between robustness to winds and sensitivity to faults is challenging due to simultaneous influence of winds and latent faults on monitored sensors. Different from conventional residual generators that do not consider any constraints, we propose a constrained residual generator using moving horizon estimation. The main contribution is improved fault sensitivity by exploiting known bounds on winds in residual generation. By analyzing the Karush-Kuhn-Tucker conditions of the formulated moving horizon estimation problem, it is shown that this improvement is attributed to active inequality constraints caused by faults. When the weighting matrices in the moving horizon estimation problem are tuned to increase robustness to winds, its fault sensitivity does not simply decrease as one would expect in conventional unconstrained residual generators. Instead, its fault sensitivity increases when the fault is large enough to activate some inequality constraints. This fault sensitivity improvement is not restricted to this particular application, but can be achieved by any general moving horizon estimation based residual generator. A high-fidelity Airbus simulator is used to illustrate the advantage of our proposed approach in terms of fault sensitivity.
SYNov 14, 2016
Gray Box Identification of State-Space Models Using Difference of Convex ProgrammingChengpu Yu, Lennart Ljung, Michel Verhaegen
Gray-box identification is prevalent in modeling physical and networked systems. However, due to the non-convex nature of the gray-box identification problem, good initial parameter estimates are crucial for a successful application. In this paper, a new identification method is proposed by exploiting the low-rank and structured Hankel matrix of impulse response. This identification problem is recasted into a difference-of-convex programming problem, which is then solved by the sequential convex programming approach with the associated initialization obtained by nuclear-norm optimization. The presented method aims to achieve the maximum impulse-response fitting while not requiring additional (non-convex) conditions to secure non-singularity of the similarity transformation relating the given state-space matrices to the gray-box parameterized ones. This overcomes a persistent shortcoming in a number of recent contributions on this topic, and the new method can be applied for the structured state-space realization even if the involved system parameters are unidentifiable. The method can be used both for directly estimating the gray-box parameters and for providing initial parameter estimates for further iterative search in a conventional gray-box identification setup.
SYDec 14, 2016
N2SID: Nuclear Norm Subspace IdentificationMichel Verhaegen, Anders Hansson
The identification of multivariable state space models in innovation form is solved in a subspace identification framework using convex nuclear norm optimization. The convex optimization approach allows to include constraints on the unknown matrices in the data-equation characterizing subspace identification methods, such as the lower triangular block-Toeplitz of weighting matrices constructed from the Markov parameters of the unknown observer. The classical use of instrumental variables to remove the influence of the innovation term on the data equation in subspace identification is avoided. The avoidance of the instrumental variable projection step has the potential to improve the accuracy of the estimated model predictions, especially for short data length sequences. This is illustrated using a data set from the DaSIy library. An efficient implementation in the framework of the Alternating Direction Method of Multipliers (ADMM) is presented that is used in the validation study.
SYFeb 12, 2017
Subspace Identification of Large-Scale 1D Homogeneous NetworksChengpu Yu, Michel Verhaegen, Anders Hansson
This paper considers the identification of large-scale 1D networks consisting of identical LTI dynamical systems. A new subspace identification method is developed that only uses local input-output information and does not rely on knowledge about the local state interaction. The identification of the local system matrices (up to a similarity transformation) is done via a low dimensional subspace retrieval step that enables the estimation of the Markov parameters of a locally lifted system. Using the estimated Markov parameters, the state-space realization of a single subsystem in the network is determined. The low dimensional subspace retrieval step exploits various key structural properties that are present in the data equation such as a low rank property and a {\em two-layer} Toeplitz structure in the data matrices constructed from products of the system matrices. For the estimation of the system matrices of a single subsystem, it is formulated as a structured low-rank matrix factorization problem. The effectiveness of the proposed identification method is demonstrated by a simulation example.
CVJun 3, 2021
Nonuniform Defocus Removal for Image ClassificationNguyen Hieu Thao, Oleg Soloviev, Jacques Noom et al.
We propose and study the single-frame anisoplanatic deconvolution problem associated with image classification using machine learning algorithms, named the nonuniform defocus removal (NDR) problem. Mathematical analysis of the NDR problem is done and the so-called defocus removal (DR) algorithm for solving it is proposed. Global convergence of the DR algorithm is established without imposing any unverifiable assumption. Numerical results on simulation data show significant features of DR including solvability, noise robustness, convergence, model insensitivity and computational efficiency. Physical relevance of the NDR problem and practicability of the DR algorithm are tested on experimental data. Back to the application that originally motivated the investigation of the NDR problem, we show that the DR algorithm can improve the accuracy of classifying distorted images using convolutional neural networks. The key difference of this paper compared to most existing works on single-frame anisoplanatic deconvolution is that the new method does not require the data image to be decomposable into isoplanatic subregions. Therefore, solution approaches partitioning the image into isoplanatic zones are not applicable to the NDR problem and those handling the entire image such as the DR algorithm need to be developed and analyzed.
SYOct 7, 2018
QUARKS: Identification of large-scale Kronecker Vector-AutoRegressive modelsBaptiste Sinquin, Michel Verhaegen
In this paper we propose a Kronecker-based modeling for identifying the spatial-temporal dynamics of large sensor arrays. The class of Kronecker networks is defined for which we formulate a Vector Autoregressive model. Its coefficient-matrices are decomposed into a sum of Kronecker products. For a two-dimensional array of size $N \times N$, and when the number of terms in the sum is small compared to $N$, exploiting the Kronecker structure leads to high data compression. We propose an Alternating Least Squares algorithm to identify the coefficient matrices with $\mathcal{O}(N^3N_t)$, where $N_t$ is the number of temporal samples, instead of $\mathcal{O}(N^6)$ in the unstructured case. This framework moreover allows for a convenient integration of more structure (e.g sparse, banded, Toeplitz) on the factor matrices. Numerical examples on atmospheric turbulence data has shown comparable performances with the unstructured least-squares estimation while the number of parameters is growing only linearly w.r.t. the number of nodes instead of quadratically in the full unstructured matrix case.
SYAug 29, 2017
Fault Estimation Filter Design with Guaranteed Stability Using Markov ParametersYiming Wan, Tamas Keviczky, Michel Verhaegen
For additive actuator and sensor faults, we propose a systematic method to design a state-space fault estimation filter directly from Markov parameters identified from fault-free data. We address this problem by parameterizing a system-inversion-based fault estimation filter with the identified Markov parameters. Even without building an explicit state-space plant model, our novel approach still allows the filter gain design for stabilization and suboptimal $\mathcal{H}_2$ performance. This design freedom cannot be achieved by other existing data-driven fault estimation filter designs so far. Another benefit of our proposed design is the convenience of determining the state order: a higher state order of the filter leads to better estimation performance, at the cost of heavier computational burden. In contrast, order determination is cumbersome when using an identified state-space plant model for the filter design, because of the complicated propagation of the model mismatch into the fault estimation errors. Simulations using an unstable aircraft system illustrate the effectiveness of the proposed new method.
MLApr 1, 2016
A sequential Monte Carlo approach to Thompson sampling for Bayesian optimizationHildo Bijl, Thomas B. Schön, Jan-Willem van Wingerden et al.
Bayesian optimization through Gaussian process regression is an effective method of optimizing an unknown function for which every measurement is expensive. It approximates the objective function and then recommends a new measurement point to try out. This recommendation is usually selected by optimizing a given acquisition function. After a sufficient number of measurements, a recommendation about the maximum is made. However, a key realization is that the maximum of a Gaussian process is not a deterministic point, but a random variable with a distribution of its own. This distribution cannot be calculated analytically. Our main contribution is an algorithm, inspired by sequential Monte Carlo samplers, that approximates this maximum distribution. Subsequently, by taking samples from this distribution, we enable Thompson sampling to be applied to (armed-bandit) optimization problems with a continuous input space. All this is done without requiring the optimization of a nonlinear acquisition function. Experiments have shown that the resulting optimization method has a competitive performance at keeping the cumulative regret limited.
LGMar 31, 2016
Online Optimization with Costly and Noisy Measurements using Random Fourier ExpansionsLaurens Bliek, Hans R. G. W. Verstraete, Michel Verhaegen et al.
This paper analyzes DONE, an online optimization algorithm that iteratively minimizes an unknown function based on costly and noisy measurements. The algorithm maintains a surrogate of the unknown function in the form of a random Fourier expansion (RFE). The surrogate is updated whenever a new measurement is available, and then used to determine the next measurement point. The algorithm is comparable to Bayesian optimization algorithms, but its computational complexity per iteration does not depend on the number of measurements. We derive several theoretical results that provide insight on how the hyper-parameters of the algorithm should be chosen. The algorithm is compared to a Bayesian optimization algorithm for a benchmark problem and three applications, namely, optical coherence tomography, optical beam-forming network tuning, and robot arm control. It is found that the DONE algorithm is significantly faster than Bayesian optimization in the discussed problems, while achieving a similar or better performance.
MLJan 29, 2016
System Identification through Online Sparse Gaussian Process Regression with Input NoiseHildo Bijl, Thomas B. Schön, Jan-Willem van Wingerden et al.
There has been a growing interest in using non-parametric regression methods like Gaussian Process (GP) regression for system identification. GP regression does traditionally have three important downsides: (1) it is computationally intensive, (2) it cannot efficiently implement newly obtained measurements online, and (3) it cannot deal with stochastic (noisy) input points. In this paper we present an algorithm tackling all these three issues simultaneously. The resulting Sparse Online Noisy Input GP (SONIG) regression algorithm can incorporate new noisy measurements in constant runtime. A comparison has shown that it is more accurate than similar existing regression algorithms. When applied to non-linear black-box system modeling, its performance is competitive with existing non-linear ARX models.
ITJan 29, 2013
Quadratic Basis PursuitHenrik Ohlsson, Allen Y. Yang, Roy Dong et al.
In many compressive sensing problems today, the relationship between the measurements and the unknowns could be nonlinear. Traditional treatment of such nonlinear relationships have been to approximate the nonlinearity via a linear model and the subsequent un-modeled dynamics as noise. The ability to more accurately characterize nonlinear models has the potential to improve the results in both existing compressive sensing applications and those where a linear approximation does not suffice, e.g., phase retrieval. In this paper, we extend the classical compressive sensing framework to a second-order Taylor expansion of the nonlinearity. Using a lifting technique and a method we call quadratic basis pursuit, we show that the sparse signal can be recovered exactly when the sampling rate is sufficiently high. We further present efficient numerical algorithms to recover sparse signals in second-order nonlinear systems, which are considerably more difficult to solve than their linear counterparts in sparse optimization.