Christoph Ortner

ML
32papers
1,464citations
Novelty61%
AI Score32

32 Papers

MLJun 15, 2022
MACE: Higher Order Equivariant Message Passing Neural Networks for Fast and Accurate Force Fields

Ilyes Batatia, Dávid Péter Kovács, Gregor N. C. Simm et al.

Creating fast and accurate force fields is a long-standing challenge in computational chemistry and materials science. Recently, several equivariant message passing neural networks (MPNNs) have been shown to outperform models built using other approaches in terms of accuracy. However, most MPNNs suffer from high computational cost and poor scalability. We propose that these limitations arise because MPNNs only pass two-body messages leading to a direct relationship between the number of layers and the expressivity of the network. In this work, we introduce MACE, a new equivariant MPNN model that uses higher body order messages. In particular, we show that using four-body messages reduces the required number of message passing iterations to just two, resulting in a fast and highly parallelizable model, reaching or exceeding state-of-the-art accuracy on the rMD17, 3BPA, and AcAc benchmark tasks. We also demonstrate that using higher order messages leads to an improved steepness of the learning curves.

MLMay 13, 2022
The Design Space of E(3)-Equivariant Atom-Centered Interatomic Potentials

Ilyes Batatia, Simon Batzner, Dávid Péter Kovács et al.

The rapid progress of machine learning interatomic potentials over the past couple of years produced a number of new architectures. Particularly notable among these are the Atomic Cluster Expansion (ACE), which unified many of the earlier ideas around atom density-based descriptors, and Neural Equivariant Interatomic Potentials (NequIP), a message passing neural network with equivariant features that showed state of the art accuracy. In this work, we construct a mathematical framework that unifies these models: ACE is generalised so that it can be recast as one layer of a multi-layer architecture. From another point of view, the linearised version of NequIP is understood as a particular sparsification of a much larger polynomial model. Our framework also provides a practical tool for systematically probing different choices in the unified design space. We demonstrate this by an ablation study of NequIP via a set of experiments looking at in- and out-of-domain accuracy and smooth extrapolation very far from the training data, and shed some light on which design choices are critical for achieving high accuracy. Finally, we present BOTNet (Body-Ordered-Tensor-Network), a much-simplified version of NequIP, which has an interpretable architecture and maintains accuracy on benchmark datasets.

CHEM-PHOct 2, 2022
Tensor-reduced atomic density representations

James P. Darby, Dávid P. Kovács, Ilyes Batatia et al.

Density based representations of atomic environments that are invariant under Euclidean symmetries have become a widely used tool in the machine learning of interatomic potentials, broader data-driven atomistic modelling and the visualisation and analysis of materials datasets.The standard mechanism used to incorporate chemical element information is to create separate densities for each element and form tensor products between them. This leads to a steep scaling in the size of the representation as the number of elements increases. Graph neural networks, which do not explicitly use density representations, escape this scaling by mapping the chemical element information into a fixed dimensional space in a learnable way. We recast this approach as tensor factorisation by exploiting the tensor structure of standard neighbour density based descriptors. In doing so, we form compact tensor-reduced representations whose size does not depend on the number of chemical elements, but remain systematically convergeable and are therefore applicable to a wide range of data analysis and regression tasks.

NANov 26, 2008
An Analysis of Node-Based Cluster Summation Rules in the Quasicontinuum Method

Mitchell Luskin, Christoph Ortner

We investigate two examples of node-based cluster summation rules that have been proposed for the quasicontinuum method: a force-based approach (Knap & Ortiz, J. Mech. Phys. Solids 49, 2001), and an energy-based approach which is a generalization of the non-local quasicontinuum method (Eidel & Stukowski, J. Mech. Phys. Solids, to appear). We show that, even for the case of nearest neighbour interaction in a one-dimensional periodic chain, both of these approaches create large errors when used with graded and, more generally, non-smooth meshes. These errors cannot be removed by increasing the cluster size. We offer some suggestions how the accuracy of (cluster) summation rules may be improved.

COMP-PHJun 23, 2023
Retrieval of Boost Invariant Symbolic Observables via Feature Importance

Jose M Munoz, Ilyes Batatia, Christoph Ortner et al.

Deep learning approaches for jet tagging in high-energy physics are characterized as black boxes that process a large amount of information from which it is difficult to extract key distinctive observables. In this proceeding, we present an alternative to deep learning approaches, Boost Invariant Polynomials, which enables direct analysis of simple analytic expressions representing the most important features in a given task. Further, we show how this approach provides an extremely low dimensional classifier with a minimum set of features representing %effective discriminating physically relevant observables and how it consequently speeds up the algorithm execution, with relatively close performance to the algorithm using the full information.

NAMar 15, 2011
The role of the patch test in 2D atomistic-to-continuum coupling methods

Christoph Ortner

For a general class of atomistic-to-continuum coupling methods, coupling multi-body interatomic potentials with a P1-finite element discretisation of Cauchy--Born nonlinear elasticity, this paper adresses the question whether patch test consistency (or, absence of ghost forces) implies a first-order error estimate. In two dimensions it is shown that this is indeed true under the following additional technical assumptions: (i) an energy consistency condition, (ii) locality of the interface correction, (iii) volumetric scaling of the interface correction, and (iv) connectedness of the atomistic region. The extent to which these assumptions are necessary is discussed in detail.

NASep 2, 2013
Theory-based Benchmarking of the Blended Force-Based Quasicontinuum Method

Xingjie Helen Li, Mitchell Luskin, Christoph Ortner et al.

We formulate an atomistic-to-continuum coupling method based on blending atomistic and continuum forces. Our precise choice of blending mechanism is informed by theoretical predictions. We present a range of numerical experiments studying the accuracy of the scheme, focusing in particular on its stability. These experiments confirm and extend the theoretical predictions, and demonstrate a superior accuracy of B-QCF over energy-based blending schemes.

NAOct 2, 2011
Construction and sharp consistency estimates for atomistic/continuum coupling methods with general interfaces: a 2D model problem

Christoph Ortner, Lei Zhang

We present a new variant of the geometry reconstruction approach for the formulation of atomistic/continuum coupling methods (a/c methods). For multi-body nearest-neighbour interactions on the 2D triangular lattice, we show that patch test consistent a/c methods can be constructed for arbitrary interface geometries. Moreover, we prove that all methods within this class are first-order consistent at the atomistic/continuum interface and second-order consistent in the interior of the continuum region.

NADec 12, 2011
Positive-Definiteness of the Blended Force-Based Quasicontinuum Method

Xingjie Helen Li, Mitchell Luskin, Christoph Ortner

The development of consistent and stable quasicontinuum models for multi-dimensional crystalline solids remains a challenge. For example, proving stability of the force-based quasicontinuum (QCF) model remains an open problem. In 1D and 2D, we show that by blending atomistic and Cauchy--Born continuum forces (instead of a sharp transition as in the QCF method) one obtains positive-definite blended force-based quasicontinuum (B-QCF) models. We establish sharp conditions on the required blending width.

NAMay 23, 2011
A Computational and Theoretical Investigation of the Accuracy of Quasicontinuum Methods

Brian Van Koten, Xingjie Helen Li, Mitchell Luskin et al.

We give computational results to study the accuracy of several quasicontinuum methods for two benchmark problems - the stability of a Lomer dislocation pair under shear and the stability of a lattice to plastic slip under tensile loading. We find that our theoretical analysis of the accuracy near instabilities for one-dimensional model problems can successfully explain most of the computational results for these multi-dimensional benchmark problems. However, we also observe some clear discrepancies, which suggest the need for additional theoretical analysis and benchmark problems to more thoroughly understand the accuracy of quasicontinuum methods.

NAApr 20, 2010
The Spectrum of the Force-Based Quasicontinuum Operator for a Homogeneous Periodic Chain

Matthew Dobson, Christoph Ortner, Alexander V. Shapeev

We show under general conditions that the linearized force-based quasicontinuum (QCF) operator has a positive spectrum, which is identical to the spectrum of the quasinonlocal quasicontinuum (QNL) operator in the case of second-neighbour interactions. Moreover, we establish a bound on the condition number of a matrix of eigenvectors that is uniform in the number of atoms and the size of the atomistic region. These results establish the validity of and improve upon recent conjectures ([arXiv:0907.3861, Conjecture 2] and [arXiv:0910.2013, Conjecture 8]) which were based on numerical experiments. As immediate consequences of our results we obtain rigorous estimates for convergence rates of (preconditioned) GMRES algorithms, as well as a new stability estimate for the QCF method.

NANov 14, 2012
A posteriori error control for a quasicontinuum approximation of a periodic chain

Christoph Ortner, Hao Wang

We consider a 1D periodic atomistic model, for which we formulate and analyze an adaptive variant of a quasicontinuum method. We establish a posteriori error estimates for the energy norm and for the energy, based on a posteriori residual and stability estimates. We formulate adaptive mesh refinement algorithms based on these error estimators. Our numerical experiments indicate optimal convergence rates of these algorithms.

COMP-PHJul 17, 2022
BIP: Boost Invariant Polynomials for Efficient Jet Tagging

Jose M Munoz, Ilyes Batatia, Christoph Ortner

Deep Learning approaches are becoming the go-to methods for data analysis in High Energy Physics (HEP). Nonetheless, most physics-inspired modern architectures are computationally inefficient and lack interpretability. This is especially the case with jet tagging algorithms, where computational efficiency is crucial considering the large amounts of data produced by modern particle detectors. In this work, we present a novel, versatile and transparent framework for jet representation; invariant to Lorentz group boosts, which achieves high accuracy on jet tagging benchmarks while being orders of magnitudes faster to train and evaluate than other modern approaches for both supervised and unsupervised schemes.

NAApr 2, 2011
Analysis of an Energy-based Atomistic/Continuum Coupling Approximation of a Vacancy in the 2D Triangular Lattice

Christoph Ortner, Alexander V. Shapeev

We present a comprehensive a priori error analysis of a practical energy based atomistic/continuum coupling method (Shapeev, arXiv:1010.0512) in two dimensions, for finite-range pair-potential interactions, in the presence of vacancy defects. The majority of the work is devoted to the analysis of consistency and stability of the method. These yield a priori error estimates in the H1-norm and the energy, which depend on the mesh size and the "smoothness" of the atomistic solution in the continuum region. Based on these error estimates, we present heuristics for an optimal choice of the atomistic region and the finite element mesh, which yields convergence rates in terms of the number of degrees of freedom. The analytical predictions are supported by extensive numerical tests.

NANov 29, 2017
Force-Based Atomistic/Continuum Blending for Multilattices

Derek Olson, Xingjie Li, Christoph Ortner et al.

We formulate the blended force-based quasicontinuum (BQCF) method for multilattices and develop rigorous error estimates in terms of the approximation parameters: atomistic region, blending region and continuum finite element mesh. Balancing the approximation parameters yields a convergent atomistic/continuum multiscale method for multilattices with point defects, including a rigorous convergence rate in terms of the computational cost. The analysis is illustrated with numerical results for a Stone--Wales defect in graphene.

NAMar 27, 2012
Symmetries of 2-lattices and second order accuracy of the Cauchy--Born Model

Brian Van Koten, Christoph Ortner

We show that the Cauchy--Born model of a single-species 2-lattice is second order if the atomistic and continuum kinematics are connected in a novel way. Our proof uses a generalization to 2-lattices of the point symmetry of Bravais lattices. Moreover, by identifying similar symmetries in multi-species pair interaction models, we construct a new stored energy density, using shift-gradients but not strain gradients, that is also second order accurate. These results can be used to develop highly accurate continuum models and atomistic/continuum coupling methods for materials such as graphene, hcp metals, and shape memory alloys.

NAApr 10, 2011
Linear Stationary Iterative Methods for the Force-based Quasicontinuum Approximation

Mitchell Luskin, Christoph Ortner

Force-based multiphysics coupling methods have become popular since they provide a simple and efficient coupling mechanism, avoiding the difficulties in formulating and implementing a consistent coupling energy. They are also the only known pointwise consistent methods for coupling a general atomistic model to a finite element continuum model. However, the development of efficient and reliable iterative solution methods for the force-based approximation presents a challenge due to the non-symmetric and indefinite structure of the linearized force-based quasicontinuum approximation, as well as to its unusual stability properties. In this paper, we present rigorous numerical analysis and computational experiments to systematically study the stability and convergence rate for a variety of linear stationary iterative methods.

NAMar 9, 2016
On assessing the accuracy of defect free energy computations

Matthew Dobson, Manh Hong Duong, Christoph Ortner

We develop a rigorous error analysis for coarse-graining of defect-formation free energy. For a one-dimensional constrained atomistic system, we establish the thermodynamic limit of the defect-formation free energy and obtain explicitly the rate of convergence. We then construct a sequence of coarse-grained energies with the same rate but significantly reduced computational cost. We illustrate our analytical results through explicit computations for the case of harmonic potentials and through numerical simulations.

NAMay 18, 2016
Analysis of a Predictor-Corrector Method for Computationally Efficient Modeling of Surface Effects in 1D

Andrew J. Binder, Mitchell Luskin, Christoph Ortner

The regular Cauchy--Born method is a useful and efficient tool for analyzing bulk properties of materials in the absence of defects. However, the method normally fails to capture surface effects, which are essential to determining material properties at small length scales. In this paper, we present a corrector method that improves upon the prediction for material behavior from the Cauchy--Born method over a small boundary layer at the surface of a 1D material by capturing the missed surface effects. We justify the separation of the problem into a bulk response and a localized surface correction by establishing an error estimate, which vanishes in the long wavelength limit.

MLOct 16, 2023
Equivariant Matrix Function Neural Networks

Ilyes Batatia, Lars L. Schaaf, Huajie Chen et al.

Graph Neural Networks (GNNs), especially message-passing neural networks (MPNNs), have emerged as powerful architectures for learning on graphs in diverse applications. However, MPNNs face challenges when modeling non-local interactions in graphs such as large conjugated molecules, and social networks due to oversmoothing and oversquashing. Although Spectral GNNs and traditional neural networks such as recurrent neural networks and transformers mitigate these challenges, they often lack generalizability, or fail to capture detailed structural relationships or symmetries in the data. To address these concerns, we introduce Matrix Function Neural Networks (MFNs), a novel architecture that parameterizes non-local interactions through analytic matrix equivariant functions. Employing resolvent expansions offers a straightforward implementation and the potential for linear scaling with system size. The MFN architecture achieves stateof-the-art performance in standard graph benchmarks, such as the ZINC and TU datasets, and is able to capture intricate non-local interactions in quantum systems, paving the way to new state-of-the-art force fields.

MLMay 31, 2023
A General Framework for Equivariant Neural Networks on Reductive Lie Groups

Ilyes Batatia, Mario Geiger, Jose Munoz et al.

Reductive Lie Groups, such as the orthogonal groups, the Lorentz group, or the unitary groups, play essential roles across scientific fields as diverse as high energy physics, quantum mechanics, quantum chromodynamics, molecular dynamics, computer vision, and imaging. In this paper, we present a general Equivariant Neural Network architecture capable of respecting the symmetries of the finite-dimensional representations of any reductive Lie Group G. Our approach generalizes the successful ACE and MACE architectures for atomistic point clouds to any data equivariant to a reductive Lie group action. We also introduce the lie-nn software library, which provides all the necessary tools to develop and implement such general G-equivariant neural networks. It implements routines for the reduction of generic tensor products of representations into irreducible representations, making it easy to apply our architecture to a wide range of problems and groups. The generality and performance of our approach are demonstrated by applying it to the tasks of top quark decay tagging (Lorentz group) and shape recognition (orthogonal group).

NAOct 14, 2018
Geometry Equilibration of Crystalline Defects in Quantum and Atomistic Descriptions

Huajie Chen, Faizan Q. Nazar, Christoph Ortner

We develop a rigorous framework for modelling the geometry equilibration of crystalline defects. We formulate the equilibration of crystal defects as a variational problems on a discrete energy space and establish qualitatively sharp far-field decay estimates for the equilibrium configuration. This work extends Ehrlacher, Ortner, Shapeev (2016) by admitting infinite-range interaction which in particular includes some quantum chemistry based interatomic potentials.

MATH-PHSep 11, 2017
Thermodynamic Limit of Crystal Defects with Finite Temperature Tight Binding

Huajie Chen, Jianfeng Lu, Christoph Ortner

We consider a tight binding model for localised crystalline defects with electrons in the canonical ensemble (finite electronic temperature) and nuclei positions relaxed according to the Born--Oppenheimer approximation. We prove that the limit model as the computational domain size grows to infinity is formulated in the grand-canonical ensemble for the electrons. The Fermi-level for the limit model is fixed at a homogeneous crystal level, independent of the defect or electron number in the sequence of finite-domain approximations. We quantify the rates of convergence for the nuclei configuration and for the Fermi-level.

MATH-PHAug 5, 2016
Electronic Density of States for Incommensurate Layers

Daniel Massatt, Mitchell Luskin, Christoph Ortner

We prove that the electronic density of states (DOS) for 2D incommensurate layered structures, where Bloch theory does not apply, is well-defined as the thermodynamic limit of finite clusters. In addition, we obtain an explicit representation formula for the DOS as an integral over local configurations. Next, based on this representation formula, we propose a novel algorithm for computing electronic structure properties in incommensurate heterostructures, which overcomes limitations of the common approach to artificially strain a large supercell and then apply Bloch theory.

NAJul 29, 2016
Convergence and Cycling in Walker-type Saddle Search Algorithms

Antoine Levitt, Christoph Ortner

Algorithms for computing local minima of smooth objective functions enjoy a mature theory as well as robust and efficient implementations. By comparison, the theory and practice of saddle search is destitute. In this paper we present results for idealized versions of the dimer and gentlest ascent (GAD) saddle search algorithms that show-case the limitations of what is theoretically achievable within the current class of saddle search algorithms: (1) we present an improved estimate on the region of attraction of saddles; and (2) we construct quasi-periodic solutions which indicate that it is impossible to obtain globally convergent variants of dimer and GAD type algorithms.

NASep 22, 2015
QM/MM methods for crystalline defects. Part 2: Consistent energy and force-mixing

Huajie Chen, Christoph Ortner

QM/MM hybrid methods employ accurate quantum (QM) models only in regions of interest (defects) and switch to computationally cheaper interatomic potential (MM) models to describe the crystalline bulk. We develop two QM/MM hybrid methods for crystalline defect simulations, an energy-based and a force-based formulation, employing a tight binding QM model. Both methods build on two principles: (i) locality of the QM model; and (ii) constructing the MM model as an explicit and controllable approximation of the QM model. This approach enables us to establish explicit convergence rates in terms of the size of QM region.

NAJun 18, 2015
QM/MM methods for crystalline defects. Part 1: Locality of the tight binding model

Huajie Chen, Christoph Ortner

The tight binding model is a minimal electronic structure model for molecular modelling and simulation. We show that the total energy in this model can be decomposed into site energies, that is, into contributions from each atomic site whose influence on their environment decays exponentially. This result lays the foundation for a rigorous analysis of QM/MM coupling schemes.

NAJul 2, 2010
Accuracy of Quasicontinuum Approximations Near Instabilities

Matthew Dobson, Mitchell Luskin, Christoph Ortner

The formation and motion of lattice defects such as cracks, dislocations, or grain boundaries, occurs when the lattice configuration loses stability, that is, when an eigenvalue of the Hessian of the lattice energy functional becomes negative. When the atomistic energy is approximated by a hybrid energy that couples atomistic and continuum models, the accuracy of the approximation can only be guaranteed near deformations where both the atomistic energy as well as the hybrid energy are stable. We propose, therefore, that it is essential for the evaluation of the predictive capability of atomistic-to-continuum coupling methods near instabilities that a theoretical analysis be performed, at least for some representative model problems, that determines whether the hybrid energies remain stable {\em up to the onset of instability of the atomistic energy}. We formulate a one-dimensional model problem with nearest and next-nearest neighbor interactions and use rigorous analysis, asymptotic methods, and numerical experiments to obtain such sharp stability estimates for the basic conservative quasicontinuum (QC) approximations. Our results show that the consistent quasi-nonlocal QC approximation correctly reproduces the stability of the atomistic system, whereas the inconsistent energy-based QC approximation incorrectly predicts instability at a significantly reduced applied load that we describe by an analytic criterion in terms of the derivatives of the atomistic potential.

NAJul 2, 2010
Iterative Methods for the Force-based Quasicontinuum Approximation

Matthew Dobson, Mitchell Luskin, Christoph Ortner

Force-based atomistic-continuum hybrid methods are the only known pointwise consistent methods for coupling a general atomistic model to a finite element continuum model. For this reason, and due to their algorithmic simplicity, force-based coupling methods have become a popular class of atomistic-continuum hybrid models as well as other types of multiphysics models. However, the recently discovered unusual stability properties of the linearized force-based quasicontinuum (QCF) approximation, especially its indefiniteness, present a challenge to the development of efficient and reliable iterative methods. We present analytic and computational results for the generalized minimal residual (GMRES) solution of the linearized QCF equilibrium equations. We show that the GMRES method accurately reproduces the stability of the force-based approximation and conclude that an appropriately preconditioned GMRES method results in a reliable and efficient solution method.

NAJul 22, 2009
Sharp Stability Estimates for the Force-based Quasicontinuum Method

Matthew Dobson, Mitchell Luskin, Christoph Ortner

A sharp stability analysis of atomistic-to-continuum coupling methods is essential for evaluating their capabilities for predicting the formation and motion of lattice defects. We formulate a simple one-dimensional model problem and give a detailed analysis of the stability of the force-based quasicontinuum (QCF) method. The focus of the analysis is the question whether the QCF method is able to predict a critical load at which fracture occurs. Numerical experiments show that the spectrum of a linearized QCF operator is identical to the spectrum of a linearized energy-based quasi-nonlocal quasicontinuum operator (QNL), which we know from our previous analyses to be positive below the critical load. However, the QCF operator is non-normal and it turns out that it is not generally positive definite, even when all of its eigenvalues are positive. Using a combination of rigorous analysis and numerical experiments, we investigate in detail for which choices of "function spaces" the QCF operator is stable, uniformly in the size of the atomistic system. Force-based multi-physics coupling methods are popular techniques to circumvent the difficulties faced in formulating consistent energy-based coupling pproaches. Even though the QCF method is possibly the simplest coupling method of this kind, we anticipate that many of our observations apply more generally.

NAMar 8, 2009
Stability, Instability, and Error of the Force-based Quasicontinuum Approximation

Matthew Dobson, Mitchell Luskin, Christoph Ortner

Due to their algorithmic simplicity and high accuracy, force-based model coupling techniques are an exciting development in computational physics. For example, the force-based quasicontinuum approximation is the only known pointwise consistent quasicontinuum approximation for coupling a general atomistic model with a finite element continuum model. In this paper, we analyze the stability of the force-based quasicontinuum approximation. We then use our stability result to obtain an optimal order error analysis of this coupling method that provides theoretical justification for the high accuracy of the force-based quasicontinuum approximation -- the computational efficiency of continuum modeling can be utilized without the loss of significant accuracy if defects are captured in the atomistic region. The main challenge we need to overcome is the fact (which we prove) that the linearized quasicontinuum operator is typically not positive definite. Moreover, we prove that no uniform inf-sup stability condition holds for discrete versions of the $W^{1,p}$-$W^{1,q}$ "duality pairing" with $1/p+1/q=1$, if $1 \leq p < \infty$. We must therefore derive an inf-sup stability condition for a discrete version of the $W^{1,\infty}$-$W^{1,1}$ "duality pairing" which then leads to optimal order error estimates in a discrete $W^{1,\infty}$-norm.