8.7NAMay 18
DG = FEM + flat elements, Part I: DiffusionJiří Szotkowski, Václav Kučera, Chi-Wang Shu et al.
We establish a simple, rigorous, and easy to implement connection between the classical continuous finite element method (FEM) and the discontinuous Galerkin (DG) method for Poisson's problem. The key idea is to insert a vanishing-thickness layer of "dummy" elements along cell interfaces. By modifying the diffusion coefficient on these elements to be proportional to their thickness, we prove the FEM formulation converges to Babuška-Zlámal DG with trapezoidal edge quadrature. The scheme is trivial to implement by (i) a mesh edit that introduces degenerate interface elements and (ii) a single Jacobian threshold in an otherwise unmodified FEM code to handle the degenerate elements via the tempered finite element (TFEM) framework. We provide a rigorous derivation of the resulting TFEM-DG scheme, prove optimal $H^1$ and $L^2$ error estimates, and present numerical experiments in 2D and 3D. The method allows for simple implementation of DG in a FEM code and even adaptive element-by-element switching between FEM and DG with minimal coding effort. The framework is readily extensible, as we will demonstrate in a companion paper dedicated to evolutionary nonlinear first-order hyperbolic systems.
6.5DCMay 15
An efficient multi-GPU implementation for the Discontinuous Galerkin ocean model SLIMMiguel De Le Court, Vincent Legat, Ange P. Ishimwe et al.
Unstructured-mesh ocean models are increasingly used for coastal applications due to their ability to represent complex geometries and apply local grid refinement where needed. However, their broader use has been hindered by their high computational cost, particularly for models based on the Discontinuous Galerkin finite element (DG-FE) method, which involves significantly more degrees of freedom than traditional finite volume or continuous finite element approaches. The rapid emergence of GPU-based high-performance computing architectures now offers a pathway to address this limitation, as DG-FE formulations are inherently well suited to massively parallel, element-wise computations. Here, we present a full 3D DG-FE ocean model implementation optimized for both single- and multi-GPU systems, with support for both NVIDIA and AMD architectures. We detail the computational strategies employed to achieve high performance, including memory layout optimization, kernel-level parallelization, and matrix-free solvers for key vertical processes. Benchmark results demonstrate that a single HPC-grade GPU (e.g. NVIDIA A100) delivers performance equivalent to approximately 1500 CPU cores, while replacing a 128-core CPU node with a 4xA100 GPU node yields a speedup of around 50x. Weak-scaling efficiency is maintained up to 1024 GPUs. We further demonstrate the model's capabilities on a real-world application in the Great Barrier Reef, achieving a spatial resolution five times finer than the most accurate existing model while maintaining a physical-to-numerical time ratio of 100. These results highlight how GPU-accelerated DG-FE methods can dramatically advance the capabilities of unstructured-mesh ocean modeling, enabling ultra-high-resolution coastal simulations that were previously infeasible.
COMP-PHJul 14, 2015
Optimizing the geometrical accuracy of curvilinear meshesThomas Toulorge, Jonathan Lambrechts, Jean-François Remacle
This paper presents a method to generate valid high order meshes with optimized geometrical accuracy. The high order meshing procedure starts with a linear mesh, that is subsequently curved without taking care of the validity of the high order elements. An optimization procedure is then used to both untangle invalid elements and optimize the geometrical accuracy of the mesh. Standard measures of the distance between curves are considered to evaluate the geometrical accuracy in planar two-dimensional meshes, but they prove computationally too costly for optimization purposes. A fast estimate of the geometrical accuracy, based on Taylor expansions of the curves, is introduced. An unconstrained optimization procedure based on this estimate is shown to yield significant improvements in the geometrical accuracy of high order meshes, as measured by the standard Haudorff distance between the geometrical model and the mesh. Several examples illustrate the beneficial impact of this method on CFD solutions, with a particular role of the enhanced mesh boundary smoothness.