SYLGJan 2, 2023

Sparse neural networks with skip-connections for identification of aluminum electrolysis cell

arXiv:2301.00582v22 citationsh-index: 45
Originality Incremental advance
AI Analysis

This work addresses the challenge of accurate and stable system identification for aluminum electrolysis, a costly industrial process with infrequent data, though it is incremental as it builds on existing neural network techniques.

The paper tackled the problem of unstable long-term predictions in neural networks for aluminum electrolysis cell identification by combining concatenated skip-connections and ℓ1 regularization, resulting in improved open-loop stability and predictive accuracy across all dataset sizes and prediction horizons compared to standard methods.

Neural networks are rapidly gaining interest in nonlinear system identification due to the model's ability to capture complex input-output relations directly from data. However, despite the flexibility of the approach, there are still concerns about the safety of these models in this context, as well as the need for large amounts of potentially expensive data. Aluminum electrolysis is a highly nonlinear production process, and most of the data must be sampled manually, making the sampling process expensive and infrequent. In the case of infrequent measurements of state variables, the accuracy and open-loop stability of the long-term predictions become highly important. Standard neural networks struggle to provide stable long-term predictions with limited training data. In this work, we investigate the effect of combining concatenated skip-connections and the sparsity-promoting $\ell_1$ regularization on the open-loop stability and accuracy of forecasts with short, medium, and long prediction horizons. The case study is conducted on a high-dimensional and nonlinear simulator representing an aluminum electrolysis cell's mass and energy balance. The proposed model structure contains concatenated skip connections from the input layer and all intermittent layers to the output layer, referred to as InputSkip. $\ell_1$ regularized InputSkip is called sparse InputSkip. The results show that sparse InputSkip outperforms dense and sparse standard feedforward neural networks and dense InputSkip regarding open-loop stability and long-term predictive accuracy. The results are significant when models are trained on datasets of all sizes (small, medium, and large training sets) and for all prediction horizons (short, medium, and long prediction horizons.)

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes