LGAug 19, 2024Code
Machine Learning with Physics Knowledge for Prediction: A SurveyJoe Watson, Chen Song, Oliver Weeger et al. · cambridge
This survey examines the broad suite of methods and models for combining machine learning with physics knowledge for prediction and forecast, with a focus on partial differential equations. These methods have attracted significant interest due to their potential impact on advancing scientific research and industrial practices by improving predictive models with small- or large-scale datasets and expressive predictive models with useful inductive biases. The survey has two parts. The first considers incorporating physics knowledge on an architectural level through objective functions, structured predictive models, and data augmentation. The second considers data as physics knowledge, which motivates looking at multi-task, meta, and contextual learning as an alternative approach to incorporating physics knowledge in a data-driven fashion. Finally, we also provide an industrial perspective on the application of these methods and a survey of the open-source ecosystem for physics-informed machine learning.
59.9CEMay 29
On limitations of polyconvexityDominik K. Klein, Rogelio Ortigosa, Heinrich T. Roth et al.
Polyconvex constitutive modeling is attractive as it guarantees stability of numerical simulations and can improve the generalization behavior of material models. However, in certain applications, polyconvex formulations perform poorly in reproducing the underlying ground truth material response, which can effectively preclude their practical use. In this work, we address this issue and investigate the limitations of polyconvex constitutive modeling. The main contributions of this paper are as follows: (1) We analyze the theoretical reasons why polyconvexity may, in some cases, impose overly restrictive constraints that limit the achievable accuracy of constitutive models. Thereby, we provide analytical ellipticity guarantees for two non-polyconvex Mooney-Rivlin type potentials. (2) We investigate the practical limitations of polyconvex physics-augmented neural network constitutive models using two representative formulations: models using structural tensor-based invariants and models using signed singular values. Their performance is evaluated on datasets obtained from homogenized microstructured materials, and their predictive capabilities are assessed in finite element simulations. (3) Overall, we provide an overview of benefits, limitations, and mitigation strategies of polyconvex constitutive modeling.
48.4CEMay 26
Advances in polyconvex anisotropic hyperelasticityDominik K. Klein, Karl A. Kalina, Rogelio Ortigosa et al.
A key challenge in material theory is the formulation of models that satisfy all common mechanical constitutive conditions while retaining sufficient flexibility. In this context, several important modeling aspects remain unresolved for polyconvex anisotropic hyperelasticity. We address some of these challenges and apply our results for physics-augmented neural network (PANN) constitutive modeling. The main contributions of this paper are as follows: (1) We propose a new polyconvex PANN constitutive model for anisotropic hyperelasticity based on triclinic invariants and group symmetrization. For finite symmetry groups, this model fulfills all common mechanical constitutive conditions a priori. (2) We propose a group symmetrization-based method for the construction of polyconvex invariants for finite symmetry groups. Based on this, we derive a new integrity basis for a tetragonal symmetry group and a new functional basis for a cubic symmetry group. To the best of our knowledge, these are the first polyconvex integrity or functional bases for symmetry groups characterized by structural tensors of order higher than two. (3) We provide an extensive introduction to the construction of polyconvex integrity and functional bases, which form the basis of polyconvex invariant-based constitutive models. We discuss polyconvex bases for triclinic, isotropic, transversely isotropic, monoclinic, rhombic, tetragonal, and cubic symmetry groups. (4) We benchmark the polyconvex PANN constitutive models with highly nonlinear homogenization data of cubic metamaterials.
CEJul 4, 2024
TwinLab: a framework for data-efficient training of non-intrusive reduced-order models for digital twinsMaximilian Kannapinn, Michael Schäfer, Oliver Weeger
Purpose: Simulation-based digital twins represent an effort to provide high-accuracy real-time insights into operational physical processes. However, the computation time of many multi-physical simulation models is far from real-time. It might even exceed sensible time frames to produce sufficient data for training data-driven reduced-order models. This study presents TwinLab, a framework for data-efficient, yet accurate training of neural-ODE type reduced-order models with only two data sets. Design/methodology/approach: Correlations between test errors of reduced-order models and distinct features of corresponding training data are investigated. Having found the single best data sets for training, a second data set is sought with the help of similarity and error measures to enrich the training process effectively. Findings: Adding a suitable second training data set in the training process reduces the test error by up to 49% compared to the best base reduced-order model trained only with one data set. Such a second training data set should at least yield a good reduced-order model on its own and exhibit higher levels of dissimilarity to the base training data set regarding the respective excitation signal. Moreover, the base reduced-order model should have elevated test errors on the second data set. The relative error of the time series ranges from 0.18% to 0.49%. Prediction speed-ups of up to a factor of 36,000 are observed. Originality: The proposed computational framework facilitates the automated, data-efficient extraction of non-intrusive reduced-order models for digital twins from existing simulation models, independent of the simulation software.
46.9CEApr 14
The cross-sectional warping problem for hyperelastic beams: An efficient formulation in Voigt notationJuan C. Alzate Cobo, Tobias Henkels, Oliver Weeger
Beam theory has traditionally been restricted to small elastic strains and rigid cross-sections. Relaxing these assumptions within closed-form analytical frameworks remains challenging. In contrast, the cross-sectional warping problem provides a computational approach that enables the derivation of general, nonlinear constitutive relations for beam models, thereby overcoming both limitations. In this work, we reinterpret the cross-sectional warping problem for hyperelastic beams and propose a fully material formulation in terms of the Green-Lagrange strain and the second Piola-Kirchhoff stress tensors. Owing to the symmetry of these tensors, the formulation can be expressed efficiently in Voigt notation and is thus particularly well-suited for straightforward numerical implementation. We demonstrate the validity of this alternative formulation in numerical examples, including the computation of the effective beam stiffness, for which we derive the sensitivities of the warping displacement. To promote reproducibility, we accompany this article with an open-access repository containing the isogeometric finite element implementation and all numerical examples presented herein, enabling other researchers to readily reproduce and build upon our results.
LGFeb 4, 2025
Stable Port-Hamiltonian Neural NetworksFabian J. Roth, Dominik K. Klein, Maximilian Kannapinn et al.
In recent years, nonlinear dynamic system identification using artificial neural networks has garnered attention due to its broad potential applications across science and engineering. However, purely data-driven approaches often struggle with extrapolation and may yield physically implausible forecasts. Furthermore, the learned dynamics can exhibit instabilities, making it difficult to apply such models safely and robustly. This article introduces stable port-Hamiltonian neural networks, a machine learning architecture that incorporates physical biases of energy conservation and dissipation while ensuring global Lyapunov stability of the learned dynamics. Through illustrative and real-world examples, we demonstrate that these strong inductive biases facilitate robust learning of stable dynamics from sparse data, while avoiding instability and surpassing purely data-driven approaches in accuracy and physically meaningful generalization. Furthermore, the model's applicability and potential for data-driven surrogate modeling are showcased on multi-physics simulation data.
36.4CEApr 8
Immersed boundary-conformal isogeometric methods for magnetostaticsYusuf T. Elbadry, Giuliano Guarino, Pablo Antolín et al.
Isogeometric analysis was proposed to bridge the gap between computer-aided design and numerical discretization. However, standard multi-patch isogeometric analysis mandates conformal discretizations across patch interfaces, posing challenges for multi-material domain problems. In the context of electric machines, this requirement becomes evident in a large number of patches needed to represent machines consisting of several domains and materials. In this work, we adopt, extend, and evaluate three non-conformal discretization strategies for magnetostatic problems: a fully immersed approach, the union with non-conformal patches, and the union with conformal layers. In all three methods, boundary-conformal high-order quadrature rules are employed for integration over trimmed boundary and interface elements. In the two union approaches, material regions are, as far as possible, represented by independent non-conformal spline patches that are embedded within a background patch and coupled weakly through Nitsche's method. In the latter framework, critical interfaces are additionally surrounded by conformal layers that enable the strong imposition of boundary conditions and improved resolution of interface behavior. The proposed approaches are assessed through several magnetostatic benchmark problems and an industrial application. The numerical results show that the union methods achieve highly accurate solutions, while the fully immersed approach struggles with discontinuities in field gradients across material interfaces. Nevertheless, these methods significantly reduce the geometric preprocessing effort compared to conventional, conformal multi-patch analysis and require substantially fewer patches. Overall, this demonstrates that our immersed boundary-conformal isogeometric framework possesses great potential for efficient simulation of complex electromagnetic devices.
MTRL-SCIJun 20, 2021
Polyconvex anisotropic hyperelasticity with neural networksDominik K. Klein, Mauricio Fernández, Robert J. Martin et al.
In the present work, two machine learning based constitutive models for finite deformations are proposed. Using input convex neural networks, the models are hyperelastic, anisotropic and fulfill the polyconvexity condition, which implies ellipticity and thus ensures material stability. The first constitutive model is based on a set of polyconvex, anisotropic and objective invariants. The second approach is formulated in terms of the deformation gradient, its cofactor and determinant, uses group symmetrization to fulfill the material symmetry condition, and data augmentation to fulfill objectivity approximately. The extension of the dataset for the data augmentation approach is based on mechanical considerations and does not require additional experimental or simulation data. The models are calibrated with highly challenging simulation data of cubic lattice metamaterials, including finite deformations and lattice instabilities. A moderate amount of calibration data is used, based on deformations which are commonly applied in experimental investigations. While the invariant-based model shows drawbacks for several deformation modes, the model based on the deformation gradient alone is able to reproduce and predict the effective material behavior very well and exhibits excellent generalization capabilities. In addition, the models are calibrated with transversely isotropic data, generated with an analytical polyconvex potential. For this case, both models show excellent results, demonstrating the straightforward applicability of the polyconvex neural network constitutive models to other symmetry groups.