Maarten Schoukens

SY
h-index16
36papers
744citations
Novelty43%
AI Score53

36 Papers

SYOct 24, 2016
Filter-based regularisation for impulse response modelling

Anna Marconato, Maarten Schoukens, Johan Schoukens

In the last years, the success of kernel-based regularisation techniques in solving impulse response modelling tasks has revived the interest on linear system identification. In this work, an alternative perspective on the same problem is introduced. Instead of relying on a Bayesian framework to include assumptions about the system in the definition of the covariance matrix of the parameters, here the prior knowledge is injected at the cost function level. The key idea is to define the regularisation matrix as a filtering operation on the parameters, which allows for a more intuitive formulation of the problem from an engineering point of view. Moreover, this results in a unified framework to model low-pass, band-pass and high-pass systems, and systems with one or more resonances. The proposed filter-based approach outperforms the existing regularisation method based on the TC and DC kernels, as illustrated by means of Monte Carlo simulations on several linear modelling examples.

SYApr 25, 2018
Structure Discrimination in Block-Oriented Models Using Linear Approximations: A Theoretic Framework

Johan Schoukens, Rik Pintelon, Yves Rolain et al.

In this paper we show that it is possible to retrieve structural information about complex block-oriented nonlinear systems, starting from linear approximations of the nonlinear system around different setpoints.The key idea is to monitor the movements of the poles and zeros of the linearized models and to reduce the number of candidate models on the basis of these observations. Besides the well known open loop single branch Wiener-, Hammerstein-, and Wiener-Hammerstein systems, we also cover a number of more general structures like parallel (multi branch) Wiener-Hammerstein models, and closed loop block oriented models, including linear fractional representation (LFR) models.

SYDec 14, 2016
Initial estimates for Wiener-Hammerstein models using phase-coupled multisines

Koen Tiels, Maarten Schoukens, Johan Schoukens

Block-oriented models are often used to model nonlinear systems. These models consist of linear dynamic (L) and nonlinear static (N) sub-blocks. This paper addresses the generation of initial estimates for a Wiener-Hammerstein model (LNL cascade). While it is easy to measure the product of the two linear blocks using a Gaussian excitation and linear identification methods, it is difficult to split the global dynamics over the individual blocks. This paper first proposes a well-designed multisine excitation with pairwise coupled random phases. Next, a modified best linear approximation is estimated on a shifted frequency grid. It is shown that this procedure creates a shift of the input dynamics with a known frequency offset, while the output dynamics do not shift. The resulting transfer function, which has complex coefficients due to the frequency shift, is estimated with a modified frequency domain estimation method. The identified poles and zeros can be assigned to either the input or output dynamics. Once this is done, it is shown in the literature that the remaining initialization problem can be solved much easier than the original one. The method is illustrated on experimental data obtained from the Wiener-Hammerstein benchmark system.

SYApr 26, 2018
Structure detection of Wiener-Hammerstein systems with process noise

Erliang Zhang, Maarten Schoukens, Johan Schoukens

Identification of nonlinear block-oriented models has been extensively studied. The presence of the process noise, more precisely its location in the block-oriented model influences essentially the development of a consistent identification algorithm. The present work is proposed with the aim to localize the process noise in the block-oriented model for accurate nonlinear modeling. To this end, the response of a Wiener-Hammerstein system is theoretically analyzed, the disturbance component in the output, caused by the process noise preceding the static nonlinearity, is shown to be dependent on the input signal. Inspired by such theoretical observation, a simple and new protocol is developed to determine the location of the process noise with respect to the static nonlinearity by using an input signal that is periodic, but nonstationary within one period. In addition, the proposed technique is promising to detect the type of certain static nonlinearity (e.g., dead-zone, saturation). Finally, it is validated on a simulated example and a real-life benchmark.

SYApr 20, 2018
Extending the Best Linear Approximation Framework to the Process Noise Case

Maarten Schoukens, Rik Pintelon, Tadeusz P. Dobrowiecki et al.

The Best Linear Approximation (BLA) framework has already proven to be a valuable tool to analyze nonlinear systems and to start the nonlinear modeling process. The existing BLA framework is limited to systems with additive (colored) noise at the output. Such a noise framework is a simplified representation of reality. Process noise can play an important role in many real-life applications. This paper generalizes the Best Linear Approximation framework to account also for the process noise, both for the open-loop and the closed-loop setting, and shows that the most important properties of the existing BLA framework remain valid. The impact of the process noise contributions on the robust BLA estimation method is also analyzed.

SYJan 9, 2018
Identification of block-oriented nonlinear systems starting from linear approximations: A survey

Maarten Schoukens, Koen Tiels

Block-oriented nonlinear models are popular in nonlinear system identification because of their advantages of being simple to understand and easy to use. Many different identification approaches were developed over the years to estimate the parameters of a wide range of block-oriented nonlinear models. One class of these approaches uses linear approximations to initialize the identification algorithm. The best linear approximation framework and the $ε$-approximation framework, or equivalent frameworks, allow the user to extract important information about the system, guide the user in selecting good candidate model structures and orders, and prove to be a good starting point for nonlinear system identification algorithms. This paper gives an overview of the different block-oriented nonlinear models that can be identified using linear approximations, and of the identification algorithms that have been developed in the past. A non-exhaustive overview of the most important other block-oriented nonlinear system identification approaches is also provided throughout this paper.

SYApr 19, 2018
Comparison of several data-driven nonlinear system identification methods on a simplified glucoregulatory system example

Anna Marconato, Maarten Schoukens, Koen Tiels et al.

In this paper, several advanced data-driven nonlinear identification techniques are compared on a specific problem: a simplified glucoregulatory system modeling example. This problem represents a challenge in the development of an artificial pancreas for T1DM treatment, since for this application good nonlinear models are needed to design accurate closed-loop controllers to regulate the glucose level in the blood. Block-oriented as well as state-space models are used to describe both the dynamics and the nonlinear behavior of the insulin-glucose system, and the advantages and drawbacks of each method are pointed out. The obtained nonlinear models are accurate in simulating the patient's behavior, and some of them are also sufficiently simple to be considered in the implementation of a model-based controller to develop the artificial pancreas.

49.0SYMay 10
Learning Koopman Models From Data Under General Noise Conditions

Lucian Cristian Iacob, Máté Szécsi, Gerben Izaak Beintema et al.

This paper presents a novel identification approach of Koopman models of nonlinear systems with inputs under rather general noise conditions. The method uses deep state-space encoders based on the concept of state reconstructability and an efficient multiple-shooting formulation of the squared loss of the prediction error to estimate the dynamics and the lifted state only from input-output data. Furthermore, the Koopman model structure includes an innovation noise term that is used to handle process and measurement noise. It is shown that the proposed approach is statistically consistent (estimation error tends to zero when the number of data points goes to infinity) and computationally efficient due to the multiple-shooting formulation, by which the prediction error of the model can be calculated on multiple subsections of the data in parallel. The latter allows for efficient batch optimization of the network parameters and, at the same time, excellent long-term prediction capabilities of the obtained models. The performance of the approach is illustrated by nonlinear benchmark examples and experimental data from a Crazyflie 2.1 quadcopter.

SYNov 26, 2018
Grammar-based Representation and Identification of Dynamical Systems

Dhruv Khandelwal, Maarten Schoukens, Roland Tóth

In this paper we propose a novel approach to identify dynamical systems. The method estimates the model structure and the parameters of the model simultaneously, automating the critical decisions involved in identification such as model structure and complexity selection. In order to solve the combined model structure and model parameter estimation problem, a new representation of dynamical systems is proposed. The proposed representation is based on Tree Adjoining Grammar, a formalism that was developed from linguistic considerations. Using the proposed representation, the identification problem can be interpreted as a multi-objective optimization problem and we propose a Evolutionary Algorithm-based approach to solve the problem. A benchmark example is used to demonstrate the proposed approach. The results were found to be comparable to that obtained by state-of-the-art non-linear system identification methods, without making use of knowledge of the system description.

SYJun 28, 2016
Obtaining the Pre-Inverse of a Power Amplifier using Iterative Learning Control

Maarten Schoukens, Jules Hammenecker, Adam Cooman

Telecommunication networks make extensive use of power amplifiers to broaden the coverage from transmitter to receiver. Achieving high power efficiency is challenging and comes at a price: the wanted linear performance is degraded due to nonlinear effects. To compensate for these nonlinear disturbances, existing techniques compute the pre-inverse of the power amplifier by estimation of a nonlinear model. However, the extraction of this nonlinear model is involved and requires advanced system identification techniques. We used the plant inversion iterative learning control algorithm to investigate whether the nonlinear modeling step can be simplified. This paper introduces the iterative learning control framework for the pre-inverse estimation and predistortion of power amplifiers. The iterative learning control algorithm is used to obtain a high quality predistorted input for the power amplifier under study without requiring a nonlinear model of the power amplifier. In a second step a nonlinear pre-inverse model of the amplifier is obtained. Both the nonlinear and memory effects of a power amplifier can be compensated by this approach. The convergence of the iterative approach, and the predistortion results are illustrated on a simulation of a Motorola LDMOS transistor based power amplifier and a measurement example using the Chalmers RF WebLab measurement setup.

28.5SYMay 26
Efficient stochastic model-predictive control based on the meta-state-space representation

Bendegúz Györök, Roland Tóth, Maarten Schoukens et al.

Stochastic model-predictive control (SMPC) has evolved to a powerful framework for the control of stochastic dynamical systems. SMPC utilizes a probabilistic uncertainty description to provide a systematic trade-off between the control objective and constraint satisfaction in a statistical sense. However, the majority of existing SMPC methods face challenges related to computational tractability due to the need for stochastic inference. Approaches that apply accurate inference are computationally demanding, which can lead to serious limitations in the implementability of these methods. Hence, in practice, the uncertainty propagation and the resulting distributions are typically approximated, e.g., by Gaussian distributions. These approximations promote computational efficiency, but are often too conservative, becoming a limiting factor in the representation of stochastic state evolution and the implied guarantees. To overcome this fundamental limitation of SMPC approaches, we propose a novel formulation based on the meta-state-space (MSS) representation of stochastic dynamical systems. The proposed MSS-based SMPC scheme offers a computationally efficient way to forward propagate the uncertainty with a flexible and highly accurate approximation of the probabilistic system description. With the presented method, the entire output probability density function can be directly shaped, which is unprecedented among existing SMPC techniques. Finally, we provide a detailed theoretical analysis and demonstrate the effectiveness of the proposed methodology via an extensive simulation study.

SYOct 26, 2022
Deep Subspace Encoders for Nonlinear System Identification

Gerben I. Beintema, Maarten Schoukens, Roland Tóth

Using Artificial Neural Networks (ANN) for nonlinear system identification has proven to be a promising approach, but despite of all recent research efforts, many practical and theoretical problems still remain open. Specifically, noise handling and models, issues of consistency and reliable estimation under minimisation of the prediction error are the most severe problems. The latter comes with numerous practical challenges such as explosion of the computational cost in terms of the number of data samples and the occurrence of instabilities during optimization. In this paper, we aim to overcome these issues by proposing a method which uses a truncated prediction loss and a subspace encoder for state estimation. The truncated prediction loss is computed by selecting multiple truncated subsections from the time series and computing the average prediction loss. To obtain a computationally efficient estimation method that minimizes the truncated prediction loss, a subspace encoder represented by an artificial neural network is introduced. This encoder aims to approximate the state reconstructability map of the estimated model to provide an initial state for each truncated subsection given past inputs and outputs. By theoretical analysis, we show that, under mild conditions, the proposed method is locally consistent, increases optimization stability, and achieves increased data efficiency by allowing for overlap between the subsections. Lastly, we provide practical insights and user guidelines employing a numerical example and state-of-the-art benchmark results.

LGApr 20, 2022
Continuous-time identification of dynamic state-space models by deep subspace encoding

Gerben I. Beintema, Maarten Schoukens, Roland Tóth

Continuous-time (CT) modeling has proven to provide improved sample efficiency and interpretability in learning the dynamical behavior of physical systems compared to discrete-time (DT) models. However, even with numerous recent developments, the CT nonlinear state-space (NL-SS) model identification problem remains to be solved in full, considering common experimental aspects such as the presence of external inputs, measurement noise, latent states, and general robustness. This paper presents a novel estimation method that addresses all these aspects and that can obtain state-of-the-art results on multiple benchmarks with compact fully connected neural networks capturing the CT dynamics. The proposed estimation method called the subspace encoder approach (SUBNET) ascertains these results by efficiently approximating the complete simulation loss by evaluating short simulations on subsections of the data, by using an encoder function to estimate the initial state for each subsection and a novel state-derivative normalization to ensure stability and good numerical conditioning of the training process. We prove that the use of subsections increases cost function smoothness together with the necessary requirements for the existence of the encoder function and we show that the proposed state-derivative normalization is essential for reliable estimation of CT NL-SS models.

SYNov 27, 2022
A Data-driven Pricing Scheme for Optimal Routing through Artificial Currencies

David van de Sanden, Maarten Schoukens, Mauro Salazar

Mobility systems often suffer from a high price of anarchy due to the uncontrolled behavior of selfish users. This may result in societal costs that are significantly higher compared to what could be achieved by a centralized system-optimal controller. Monetary tolling schemes can effectively align the behavior of selfish users with the system-optimum. Yet, they inevitably discriminate the population in terms of income. Artificial currencies were recently presented as an effective alternative that can achieve the same performance, whilst guaranteeing fairness among the population. However, those studies were based on behavioral models that may differ from practical implementations. This paper presents a data-driven approach to automatically adapt artificial-currency tolls within repetitive-game settings. We first consider a parallel-arc setting whereby users commute on a daily basis from an individual origin to an individual destination, choosing a route in exchange of an artificial-currency price or reward, while accounting for the impact of the choices of the other users on travel discomfort. Second, we devise a model-based reinforcement learning controller that autonomously learns the optimal pricing policy by interacting with the proposed framework considering the closeness of the observed aggregate flows to a desired system-optimal distribution as a reward function. Our numerical results show that the proposed data-driven pricing scheme can effectively align the users' flows with the system optimum, significantly reducing the societal costs with respect to the uncontrolled flows (by about 15% and 25% depending on the scenario), and respond to environmental changes in a robust and efficient manner.

SYApr 4, 2023
Initialization Approach for Nonlinear State-Space Identification via the Subspace Encoder Approach

Rishi Ramkannan, Gerben I. Beintema, Roland Tóth et al.

The SUBNET neural network architecture has been developed to identify nonlinear state-space models from input-output data. To achieve this, it combines the rolled-out nonlinear state-space equations and a state encoder function, both parameterised as neural networks The encoder function is introduced to reconstruct the current state from past input-output data. Hence, it enables the forward simulation of the rolled-out state-space model. While this approach has shown to provide high-accuracy and consistent model estimation, its convergence can be significantly improved by efficient initialization of the training process. This paper focuses on such an initialisation of the subspace encoder approach using the Best Linear Approximation (BLA). Using the BLA provided state-space matrices and its associated reconstructability map, both the state-transition part of the network and the encoder are initialized. The performance of the improved initialisation scheme is evaluated on a Wiener-Hammerstein simulation example and a benchmark dataset. The results show that for a weakly nonlinear system, the proposed initialisation based on the linear reconstructability map results in a faster convergence and a better model quality.

MLApr 2, 2022
Variational message passing for online polynomial NARMAX identification

Wouter Kouw, Albert Podusenko, Magnus Koudahl et al.

We propose a variational Bayesian inference procedure for online nonlinear system identification. For each output observation, a set of parameter posterior distributions is updated, which is then used to form a posterior predictive distribution for future outputs. We focus on the class of polynomial NARMAX models, which we cast into probabilistic form and represent in terms of a Forney-style factor graph. Inference in this graph is efficiently performed by a variational message passing algorithm. We show empirically that our variational Bayesian estimator outperforms an online recursive least-squares estimator, most notably in small sample size settings and low noise regimes, and performs on par with an iterative least-squares estimator trained offline.

71.0SYApr 13
Data-driven augmentation of first-principles models under constraint-free well-posedness and stability guarantees

Bendegúz Györök, Roel Drenth, Chris Verhoek et al.

The integration of first-principles models with learning-based components, i.e., model augmentation, has gained increasing attention, as it offers higher model accuracy and faster convergence properties compared to black-box approaches, while generating physically interpretable models. Recently, a unified formulation has been proposed that generalizes existing model augmentation structures, utilizing linear fractional representations (LFRs). However, several potential benefits of the approach remain underexplored. In this work, we address three key limitations. First, the added flexibility of LFRs also introduces possible algebraic loops, i.e., a problem of well-posedness. To address this challenge, we propose a constraint-free direct parametrization of the model structure with a well-posedness guarantee. Second, we introduce a constraint-free parametrization that ensures stability of the overall model augmentation structure via contraction. Third, we adopt an efficient identification pipeline capable of handling non-smooth cost functions, such as group-lasso regularization, which facilitates automatic model order selection and discovery of the required augmentation configuration. These contributions are demonstrated on various simulation and benchmark identification examples.

21.8SYMay 12
Efficient Learning of Affine and Rational Dependency LPV Models With Linear Fractional Representation

Roel Drenth, Jan H. Hoekstra, Maarten Schoukens et al.

Identifying control-friendly models of nonlinear systems remains one of the major challenges at the intersection of system identification and control. The Linear Parameter-Varying (LPV) framework offers a promising solution, but existing identification methods often rely on model structures with affine scheduling dependency. Instead, this work proposes the use of LPV models with Linear Fractional Representation (LFR) admitting a rational scheduling-dependency, capable of modelling complex nonlinear systems with fewer scheduling variables compared to affine models. This work introduces a direct parameterization to ensure well-posedness of rational LPV-LFR models, which by joint-estimation of an LPV plant and scheduling map, using only input-output data, is capable of modelling complex nonlinear systems. Accuracy of the proposed approach is shown on two simulation examples.

49.5SYMay 4
Least Costly Space-Filling Experiment Design for the Identification of a Nonlinear System

Máté Kiss, Maarten Schoukens, Roland Tóth

The quality of an estimated nonlinear model highly depends on the data quality that was used for the system identification. By using a Gaussian Process-based optimal input design approach, a so-called space-filling dataset can be generated in the feature space of the system model. The design method is applicable for a broad type of signals and models and also incorporates information measures through optimality criteria into the signal design. However, the resulting input design can be costly to apply to the real system. The goal of this paper is to propose a space-filling input design that can minimize the experimentation cost in terms of a user defined measure, while still guaranteeing a prescribed level of space-fillingness. Through a Monte Carlo simulation study we demonstrate that the proposed method can appropriately shape the excitation signal to significantly reduce the experimental cost while the identified model performance remains adequate.

OCMay 2, 2024
Koopman Data-Driven Predictive Control with Robust Stability and Recursive Feasibility Guarantees

Thomas de Jong, Valentina Breschi, Maarten Schoukens et al.

In this paper, we consider the design of data-driven predictive controllers for nonlinear systems from input-output data via linear-in-control input Koopman lifted models. Instead of identifying and simulating a Koopman model to predict future outputs, we design a subspace predictive controller in the Koopman space. This allows us to learn the observables minimizing the multi-step output prediction error of the Koopman subspace predictor, preventing the propagation of prediction errors. To avoid losing feasibility of our predictive control scheme due to prediction errors, we compute a terminal cost and terminal set in the Koopman space and we obtain recursive feasibility guarantees through an interpolated initial state. As a third contribution, we introduce a novel regularization cost yielding input-to-state stability guarantees with respect to the prediction error for the resulting closed-loop system. The performance of the developed Koopman data-driven predictive control methodology is illustrated on a nonlinear benchmark example from the literature.

SYMay 17, 2024
Baseline Results for Selected Nonlinear System Identification Benchmarks

Max D. Champneys, Gerben I. Beintema, Roland Tóth et al.

Nonlinear system identification remains an important open challenge across research and academia. Large numbers of novel approaches are seen published each year, each presenting improvements or extensions to existing methods. It is natural, therefore, to consider how one might choose between these competing models. Benchmark datasets provide one clear way to approach this question. However, to make meaningful inference based on benchmark performance it is important to understand how well a new method performs comparatively to results available with well-established methods. This paper presents a set of ten baseline techniques and their relative performances on five popular benchmarks. The aim of this contribution is to stimulate thought and discussion regarding objective comparison of identification methodologies.

LGFeb 20, 2025
Port-Hamiltonian Neural Networks with Output Error Noise Models

Sarvin Moradi, Gerben I. Beintema, Nick Jaensson et al.

Hamiltonian neural networks (HNNs) represent a promising class of physics-informed deep learning methods that utilize Hamiltonian theory as foundational knowledge within neural networks. However, their direct application to engineering systems is often challenged by practical issues, including the presence of external inputs, dissipation, and noisy measurements. This paper introduces a novel framework that enhances the capabilities of HNNs to address these real-life factors. We integrate port-Hamiltonian theory into the neural network structure, allowing for the inclusion of external inputs and dissipation, while mitigating the impact of measurement noise through an output-error (OE) model structure. The resulting output error port-Hamiltonian neural networks (OE-pHNNs) can be adapted to tackle modeling complex engineering systems with noisy measurements. Furthermore, we propose the identification of OE-pHNNs based on the subspace encoder approach (SUBNET), which efficiently approximates the complete simulation loss using subsections of the data and uses an encoder function to predict initial states. By integrating SUBNET with OE-pHNNs, we achieve consistent models of complex engineering systems under noisy measurements. In addition, we perform a consistency analysis to ensure the reliability of the proposed data-driven model learning method. We demonstrate the effectiveness of our approach on system identification benchmarks, showing its potential as a powerful tool for modeling dynamic systems in real-world applications.

SYJan 5, 2024
State Derivative Normalization for Continuous-Time Deep Neural Networks

Jonas Weigand, Gerben I. Beintema, Jonas Ulmen et al.

The importance of proper data normalization for deep neural networks is well known. However, in continuous-time state-space model estimation, it has been observed that improper normalization of either the hidden state or hidden state derivative of the model estimate, or even of the time interval can lead to numerical and optimization challenges with deep learning based methods. This results in a reduced model quality. In this contribution, we show that these three normalization tasks are inherently coupled. Due to the existence of this coupling, we propose a solution to all three normalization challenges by introducing a normalization constant at the state derivative level. We show that the appropriate choice of the normalization constant is related to the dynamics of the to-be-identified system and we derive multiple methods of obtaining an effective normalization constant. We compare and discuss all the normalization strategies on a benchmark problem based on experimental data from a cascaded tanks system and compare our results with other methods of the identification literature.

LGJan 10, 2025
Orthogonal projection-based regularization for efficient model augmentation

Bendegúz M. Györök, Jan H. Hoekstra, Johan Kon et al.

Deep-learning-based nonlinear system identification has shown the ability to produce reliable and highly accurate models in practice. However, these black-box models lack physical interpretability, and a considerable part of the learning effort is often spent on capturing already expected/known behavior of the system, that can be accurately described by first-principles laws of physics. A potential solution is to directly integrate such prior physical knowledge into the model structure, combining the strengths of physics-based modeling and deep-learning-based identification. The most common approach is to use an additive model augmentation structure, where the physics-based and the machine-learning (ML) components are connected in parallel, i.e., additively. However, such models are overparametrized, training them is challenging, potentially causing the physics-based part to lose interpretability. To overcome this challenge, this paper proposes an orthogonal projection-based regularization technique to enhance parameter learning and even model accuracy in learning-based augmentation of nonlinear baseline models.

SYMay 2, 2023
Physics-Informed Learning Using Hamiltonian Neural Networks with Output Error Noise Models

Sarvin Moradi, Nick Jaensson, Roland Tóth et al.

In order to make data-driven models of physical systems interpretable and reliable, it is essential to include prior physical knowledge in the modeling framework. Hamiltonian Neural Networks (HNNs) implement Hamiltonian theory in deep learning and form a comprehensive framework for modeling autonomous energy-conservative systems. Despite being suitable to estimate a wide range of physical system behavior from data, classical HNNs are restricted to systems without inputs and require noiseless state measurements and information on the derivative of the state to be available. To address these challenges, this paper introduces an Output Error Hamiltonian Neural Network (OE-HNN) modeling approach to address the modeling of physical systems with inputs and noisy state measurements. Furthermore, it does not require the state derivatives to be known. Instead, the OE-HNN utilizes an ODE-solver embedded in the training process, which enables the OE-HNN to learn the dynamics from noisy state measurements. In addition, extending HNNs based on the generalized Hamiltonian theory enables to include external inputs into the framework which are important for engineering applications. We demonstrate via simulation examples that the proposed OE-HNNs results in superior modeling performance compared to classical HNNs.

SYOct 6, 2021
Deep Identification of Nonlinear Systems in Koopman Form

Lucian Cristian Iacob, Gerben Izaak Beintema, Maarten Schoukens et al.

The present paper treats the identification of nonlinear dynamical systems using Koopman-based deep state-space encoders. Through this method, the usual drawback of needing to choose a dictionary of lifting functions a priori is circumvented. The encoder represents the lifting function to the space where the dynamics are linearly propagated using the Koopman operator. An input-affine formulation is considered for the lifted model structure and we address both full and partial state availability. The approach is implemented using the the deepSI toolbox in Python. To lower the computational need of the simulation error-based training, the data is split into subsections where multi-step prediction errors are calculated independently. This formulation allows for efficient batch optimization of the network parameters and, at the same time, excellent long term prediction capabilities of the obtained models. The performance of the approach is illustrated by nonlinear benchmark examples.

LGMar 26, 2021
Improved Initialization of State-Space Artificial Neural Networks

Maarten Schoukens

The identification of black-box nonlinear state-space models requires a flexible representation of the state and output equation. Artificial neural networks have proven to provide such a representation. However, as in many identification problems, a nonlinear optimization problem needs to be solved to obtain the model parameters (layer weights and biases). A well-thought initialization of these model parameters can often avoid that the nonlinear optimization algorithm converges to a poorly performing local minimum of the considered cost function. This paper introduces an improved initialization approach for nonlinear state-space models represented as a recurrent artificial neural network and emphasizes the importance of including an explicit linear term in the model structure. Some of the neural network weights are initialized starting from a linear approximation of the nonlinear system, while others are initialized using random values or zeros. The effectiveness of the proposed initialization approach over previously proposed methods is illustrated on two benchmark examples.

SYDec 14, 2020
Non-linear State-space Model Identification from Video Data using Deep Encoders

Gerben Izaak Beintema, Roland Toth, Maarten Schoukens

Identifying systems with high-dimensional inputs and outputs, such as systems measured by video streams, is a challenging problem with numerous applications in robotics, autonomous vehicles and medical imaging. In this paper, we propose a novel non-linear state-space identification method starting from high-dimensional input and output data. Multiple computational and conceptual advances are combined to handle the high-dimensional nature of the data. An encoder function, represented by a neural network, is introduced to learn a reconstructability map to estimate the model states from past inputs and outputs. This encoder function is jointly learned with the dynamics. Furthermore, multiple computational improvements, such as an improved reformulation of multiple shooting and batch optimization, are proposed to keep the computational time under control when dealing with high-dimensional and large datasets. We apply the proposed method to a video stream of a simulated environment of a controllable ball in a unit box. The study shows low simulation error with excellent long term prediction capability of the model obtained using the proposed method.

LGDec 14, 2020
Nonlinear state-space identification using deep encoder networks

Gerben Beintema, Roland Toth, Maarten Schoukens

Nonlinear state-space identification for dynamical systems is most often performed by minimizing the simulation error to reduce the effect of model errors. This optimization problem becomes computationally expensive for large datasets. Moreover, the problem is also strongly non-convex, often leading to sub-optimal parameter estimates. This paper introduces a method that approximates the simulation loss by splitting the data set into multiple independent sections similar to the multiple shooting method. This splitting operation allows for the use of stochastic gradient optimization methods which scale well with data set size and has a smoothing effect on the non-convex cost function. The main contribution of this paper is the introduction of an encoder function to estimate the initial state at the start of each section. The encoder function estimates the initial states using a feed-forward neural network starting from historical input and output samples. The efficiency and performance of the proposed state-space encoder method is illustrated on two well-known benchmarks where, for instance, the method achieves the lowest known simulation error on the Wiener--Hammerstein benchmark.

NCDec 14, 2020
System identification of biophysical neuronal models

Thiago B. Burghi, Maarten Schoukens, Rodolphe Sepulchre

After sixty years of quantitative biophysical modeling of neurons, the identification of neuronal dynamics from input-output data remains a challenging problem, primarily due to the inherently nonlinear nature of excitable behaviors. By reformulating the problem in terms of the identification of an operator with fading memory, we explore a simple approach based on a parametrization given by a series interconnection of Generalized Orthonormal Basis Functions (GOBFs) and static Artificial Neural Networks. We show that GOBFs are particularly well-suited to tackle the identification problem, and provide a heuristic for selecting GOBF poles which addresses the ultra-sensitivity of neuronal behaviors. The method is illustrated on the identification of a bursting model from the crab stomatogastric ganglion.

SYJan 15, 2020
A Tree Adjoining Grammar Representation for Models Of Stochastic Dynamical Systems

Dhruv Khandelwal, Maarten Schoukens, Roland Tóth

Model structure and complexity selection remains a challenging problem in system identification, especially for parametric non-linear models. Many Evolutionary Algorithm (EA) based methods have been proposed in the literature for estimating model structure and complexity. In most cases, the proposed methods are devised for estimating structure and complexity within a specified model class and hence these methods do not extend to other model structures without significant changes. In this paper, we propose a Tree Adjoining Grammar (TAG) for stochastic parametric models. TAGs can be used to generate models in an EA framework while imposing desirable structural constraints and incorporating prior knowledge. In this paper, we propose a TAG that can systematically generate models ranging from FIRs to polynomial NARMAX models. Furthermore, we demonstrate that TAGs can be easily extended to more general model classes, such as the non-linear Box-Jenkins model class, enabling the realization of flexible and automatic model structure and complexity selection via EA.

SYApr 5, 2019
Data-driven Modelling of Dynamical Systems Using Tree Adjoining Grammar and Genetic Programming

Dhruv Khandelwal, Maarten Schoukens, Roland Tóth

State-of-the-art methods for data-driven modelling of non-linear dynamical systems typically involve interactions with an expert user. In order to partially automate the process of modelling physical systems from data, many EA-based approaches have been proposed for model-structure selection, with special focus on non-linear systems. Recently, an approach for data-driven modelling of non-linear dynamical systems using Genetic Programming (GP) was proposed. The novelty of the method was the modelling of noise and the use of Tree Adjoining Grammar to shape the search-space explored by GP. In this paper, we report results achieved by the proposed method on three case studies. Each of the case studies considered here is based on real physical systems. The case studies pose a variety of challenges. In particular, these challenges range over varying amounts of prior knowledge of the true system, amount of data available, the complexity of the dynamics of the system, and the nature of non-linearities in the system. Based on the results achieved for the case studies, we critically analyse the performance of the proposed method.

SYOct 16, 2018
On the Simulation of Polynomial NARMAX Models

Dhruv Khandelwal, Maarten Schoukens, Roland Tóth

In this paper, we show that the common approach for simulation non-linear stochastic models, commonly used in system identification, via setting the noise contributions to zero results in a biased response. We also demonstrate that to achieve unbiased simulation of finite order NARMAX models, in general, we require infinite order simulation models. The main contributions of the paper are two-fold. Firstly, an alternate representation of polynomial NARMAX models, based on Hermite polynomials, is proposed. The proposed representation provides a convenient way to translate a polynomial NARMAX model to a corresponding simulation model by simply setting certain terms to zero. This translation is exact when the simulation model can be written as an NFIR model. Secondly, a parameterized approximation method is proposed to curtail infinite order simulation models to a finite order. The proposed approximation can be viewed as a trade-off between the conventional approach of setting noise contributions to zero and the approach of incorporating the bias introduced by higher-order moments of the noise distribution. Simulation studies are provided to illustrate the utility of the proposed representation and approximation method.

SYSep 13, 2018
From Nonlinear Identification to Linear Parameter Varying Models: Benchmark Examples

Maarten Schoukens, Roland Tóth

Linear parameter-varying (LPV) models form a powerful model class to analyze and control a (nonlinear) system of interest. Identifying a LPV model of a nonlinear system can be challenging due to the difficulty of selecting the scheduling variable(s) a priori, which is quite challenging in case a first principles based understanding of the system is unavailable. This paper presents a systematic LPV embedding approach starting from nonlinear fractional representation models. A nonlinear system is identified first using a nonlinear block-oriented linear fractional representation (LFR) model. This nonlinear LFR model class is embedded into the LPV model class by factorization of the static nonlinear block present in the model. As a result of the factorization a LPV-LFR or a LPV state-space model with an affine dependency results. This approach facilitates the selection of the scheduling variable from a data-driven perspective. Furthermore the estimation is not affected by measurement noise on the scheduling variables, which is often left untreated by LPV model identification methods. The proposed approach is illustrated on two well-established nonlinear modeling benchmark examples.

SYSep 13, 2018
Linear Parameter Varying Representation of a class of MIMO Nonlinear Systems

Maarten Schoukens, Roland Tóth

Linear parameter-varying (LPV) models form a powerful model class to analyze and control a (nonlinear) system of interest. Identifying an LPV model of a nonlinear system can be challenging due to the difficulty of selecting the scheduling variable(s) a priori, especially if a first principles based understanding of the system is unavailable. Converting a nonlinear model to an LPV form is also non-trivial and requires systematic methods to automate the process. Inspired by these challenges, a systematic LPV embedding approach starting from multiple-input multiple-output (MIMO) linear fractional representations with a nonlinear feedback block (NLFR) is proposed. This NLFR model class is embedded into the LPV model class by an automated factorization of the (possibly MIMO) static nonlinear block present in the model. As a result of the factorization, an LPV-LFR or an LPV state-space model with affine dependency on the scheduling is obtained. This approach facilitates the selection of the scheduling variable and the connected mapping of system variables. Such a conversion method enables to use nonlinear identification tools to estimate LPV models. The potential of the proposed approach is illustrated on a 2-DOF nonlinear mass-spring-damper example.

SYAug 22, 2017
Parametric identification of parallel Wiener-Hammerstein systems

Maarten Schoukens, Anna Marconato, Rik Pintelon et al.

Block-oriented nonlinear models are popular in nonlinear modeling because of their advantages to be quite simple to understand and easy to use. To increase the flexibility of single branch block-oriented models, such as Hammerstein, Wiener, and Wiener-Hammerstein models, parallel block-oriented models can be considered. This paper presents a method to identify parallel Wiener-Hammerstein systems starting from input-output data only. In the first step, the best linear approximation is estimated for different input excitation levels. In the second step, the dynamics are decomposed over a number of parallel orthogonal branches. Next, the dynamics of each branch are partitioned into a linear time invariant subsystem at the input and a linear time invariant subsystem at the output. This is repeated for each branch of the model. The static nonlinear part of the model is also estimated during this step. The consistency of the proposed initialization procedure is proven. The method is validated on real-world measurements using a custom built parallel Wiener-Hammerstein test system.