MSSep 15, 2017
A performance spectrum for parallel computational frameworks that solve PDEsJ. Chang, K. B. Nakshatrala, M. G. Knepley et al.
Important computational physics problems are often large-scale in nature, and it is highly desirable to have robust and high performing computational frameworks that can quickly address these problems. However, it is no trivial task to determine whether a computational framework is performing efficiently or is scalable. The aim of this paper is to present various strategies for better understanding the performance of any parallel computational frameworks for solving PDEs. Important performance issues that negatively impact time-to-solution are discussed, and we propose a performance spectrum analysis that can enhance one's understanding of critical aforementioned performance issues. As proof of concept, we examine commonly used finite element simulation packages and software and apply the performance spectrum to quickly analyze the performance and scalability across various hardware platforms, software implementations, and numerical discretizations. It is shown that the proposed performance spectrum is a versatile performance model that is not only extendable to more complex PDEs such as hydrostatic ice sheet flow equations, but also useful for understanding hardware performance in a massively parallel computing environment. Potential applications and future extensions of this work are also discussed.
NAApr 9, 2016
Large-scale Optimization-based Non-negative Computational Framework for Diffusion Equations: Parallel Implementation and Performance StudiesJ. Chang, S. Karra, K. B. Nakshatrala
It is well-known that the standard Galerkin formulation, which is often the formulation of choice under the finite element method for solving self-adjoint diffusion equations, does not meet maximum principles and the non-negative constraint for anisotropic diffusion equations. Recently, optimization-based methodologies that satisfy maximum principles and the non-negative constraint for steady-state and transient diffusion-type equations have been proposed. To date, these methodologies have been tested only on small-scale academic problems. The purpose of this paper is to systematically study the performance of the non-negative methodology in the context of high performance computing (HPC). PETSc and TAO libraries are, respectively, used for the parallel environment and optimization solvers. For large-scale problems, it is important for computational scientists to understand the computational performance of current algorithms available in these scientific libraries. The numerical experiments are conducted on the state-of-the-art HPC systems, and a single-core performance model is used to better characterize the efficiency of the solvers. Our studies indicate that the proposed non-negative computational framework for diffusion-type equations exhibits excellent strong scaling for real-world large-scale problems.
CEAug 24, 2018
Composable block solvers for the four-field double porosity/permeability modelM. S. Joshaghani, J. Chang, K. B. Nakshatrala et al.
The objective of this paper is twofold. First, we propose two composable block solver methodologies to solve the discrete systems that arise from finite element discretizations of the double porosity/permeability (DPP) model. The DPP model, which is a four-field mathematical model, describes the flow of a single-phase incompressible fluid in a porous medium with two distinct pore-networks and with a possibility of mass transfer between them. Using the composable solvers feature available in PETSc and the finite element libraries available under the Firedrake Project, we illustrate two different ways by which one can effectively precondition these large systems of equations. Second, we employ the recently developed performance model called the Time-Accuracy-Size (TAS) spectrum to demonstrate that the proposed composable block solvers are scalable in both the parallel and algorithmic sense. Moreover, we utilize this spectrum analysis to compare the performance of three different finite element discretizations (classical mixed formulation with H(div) elements, stabilized continuous Galerkin mixed formulation, and stabilized discontinuous Galerkin mixed formulation) for the DPP model. Our performance spectrum analysis demonstrates that the composable block solvers are fine choices for any of these three finite element discretizations. Sample computer codes are provided to illustrate how one can easily implement the proposed block solver methodologies through PETSc command line options.
CEDec 4, 2016
Variational inequality approach to enforce the non-negative constraint for advection-diffusion equationsJ. Chang, K. B. Nakshatrala
Predictive simulations are crucial for the success of many subsurface applications, and it is highly desirable to obtain accurate non-negative solutions for transport equations in these numerical simulations. In this paper, we propose a computational framework based on the variational inequality (VI) which can also be used to enforce important mathematical properties (e.g., maximum principles) and physical constraints (e.g., the non-negative constraint). We demonstrate that this framework is not only applicable to diffusion equations but also to non-symmetric advection-diffusion equations. An attractive feature of the proposed framework is that it works with with any weak formulation for the advection-diffusion equations, including single-field formulations, which are computationally attractive. A particular emphasis is placed on the parallel and algorithmic performance of the VI approach across large-scale and heterogeneous problems. It is also shown that QP and VI are equivalent under certain conditions. State-of-the-art QP and VI solvers available from the PETSc library are used on a variety of steady-state 2D and 3D benchmarks, and a comparative study on the scalability between the QP and VI solvers is presented. We then extend the proposed framework to transient problems by simulating the miscible displacement of fluids in a heterogeneous porous medium and illustrate the importance of enforcing maximum principles for these types of coupled problems. Our numerical experiments indicate that VIs are indeed a viable approach for enforcing the maximum principles and the non-negative constraint in a large-scale computing environment. Also provided are Firedrake project files as well as a discussion on the computer implementation to help facilitate readers in understanding the proposed framework.
14.5LGApr 13
Autonomous Diffractometry Enabled by Visual Reinforcement LearningJ. Oppliger, M. Stifter, A. Rüegg et al.
Automation underpins progress across scientific and industrial disciplines. Yet, automating tasks requiring interpretation of abstract visual information remain challenging. For example, crystal alignment strongly relies on humans with the ability to comprehend diffraction patterns. Here we introduce an autonomous system that aligns single crystals without access to crystallography and diffraction theory. Using a model-free reinforcement learning framework, an agent learns to identify and navigate towards high-symmetry orientations directly from Laue diffraction patterns. Despite the absence of human supervision, the agent develops human-like strategies to achieve time-efficient alignment across different crystal symmetry classes. With this, we provide a computational framework for intelligent diffractometers. As such, our approach advances the development of automated experimental workflows in materials science.