92.0PLASM-PHMar 25
Multi-GPU Hybrid Particle-in-Cell Monte Carlo Simulations for Exascale Computing SystemsJeremy J. Williams, Jordy Trilaksono, Stefan Costea et al.
Particle-in-Cell (PIC) Monte Carlo (MC) simulations are central to plasma physics but face increasing challenges on heterogeneous HPC systems due to excessive data movement, synchronization overheads, and inefficient utilization of multiple accelerators. In this work, we present a portable, multi-GPU hybrid MPI+OpenMP implementation of BIT1 that enables scalable execution on both Nvidia and AMD accelerators through OpenMP target tasks with explicit dependencies to overlap computation and communication across devices. Portability is achieved through persistent device-resident memory, an optimized contiguous one-dimensional data layout, and a transition from unified to pinned host memory to improve large data-transfer efficiency, together with GPU Direct Memory Access (DMA) and runtime interoperability for direct device-pointer access. Standardized and scalable I/O is provided using openPMD and ADIOS2, supporting high-performance file I/O, in-memory data streaming, and in-situ analysis and visualization. Performance results on pre-exascale and exascale systems, including Frontier (OLCF-5) for up to 16,000 GPUs, demonstrate significant improvements in run time, scalability, and resource utilization for large-scale PIC MC simulations.
85.5SIMay 18
Trajectory-Integrated Accessibility Analysis of Public Electric Vehicle Charging StationsYi Ju, Jiaman Wu, Zhihan Su et al.
Electric vehicle (EV) charging infrastructure is crucial for advancing EV adoption, managing charging loads, and ensuring equitable transportation electrification. However, there remains a notable gap in comprehensive accessibility metrics that integrate the mobility of the users. This study introduces a novel accessibility metric, termed Trajectory-Integrated Public EVCS Accessibility (TI-acs), and uses it to assess public electric vehicle charging station (EVCS) accessibility for approximately 6 million residents in the San Francisco Bay Area based on detailed individual trajectory data in one week. Unlike conventional home-based metrics, TI-acs incorporates the accessibility of EVCS along individuals' travel trajectories, bringing insights on more public charging contexts, including public charging near workplaces and charging during grid off-peak periods. As of June 2024, given the current public EVCS network, Bay Area residents have, on average, 7.5 hours and 5.2 hours of access per day during which their stay locations are within 1 km (i.e. 10-12 min walking) of a public L2 and DCFC charging port, respectively. Over the past decade, TI-acs has steadily increased from the rapid expansion of the EV market and charging infrastructure. However, spatial disparities remain significant, as reflected in Gini indices of 0.38 (L2) and 0.44 (DCFC) across census tracts. Additionally, our analysis reveals racial disparities in TI-acs, driven not only by variations in charging infrastructure near residential areas but also by differences in their mobility patterns.
40.8ETMay 8
Post-Moore Technologies for Plasma Simulation: A Community RoadmapLuca Pennati, Erik M. Åsgrim, Jeremy J. Williams et al.
Plasma simulations are among the most computationally demanding scientific workloads, combining high-dimensional kinetic evolution, particle-mesh coupling, field solves, and data-intensive communication. As general-purpose processor scaling slows, post-Moore technologies are being explored to address bottlenecks in data movement, memory access, and power consumption. This paper provides a community perspective on the role of these technologies in plasma simulation, assessing three major classes: reconfigurable and data-path accelerators, non-von Neumann architectures, and quantum computing. Each is evaluated, in a co-design approach, against representative plasma workloads spanning particle-in-cell, continuum Vlasov, gyrokinetic, fluid/MHD, hybrid, and warm dense matter methods. We find that no single technology can replace existing HPC platforms. Instead, three tiers of opportunity emerge: FPGA-class and data-path accelerators offer near-term kernel offload and workflow-level data services, non-von Neumann architectures represent medium-term directions for operator-level acceleration, and quantum computing, although the least mature, is potentially the most disruptive for warm dense matter and inertial confinement fusion microphysics. We outline best practices for selective adoption and identify focused demonstrators, benchmarking, and modular software ecosystems as immediate community priorities.
1.1SYApr 13
Scalable Optimization for Mobility-Aware Coordinated Electric Vehicle Charging in Distribution Power NetworksYi Ju, Lunlong Li, Jingchun Wang et al.
Rapid growth in electric-vehicle (EV) charging demand is placing increasing stress on distribution power networks (DPNs), whose hosting capacity is often limited and spatially uneven. Beyond demonstrating that coordination can help, this paper answers an open question that is central for planners: what is the maximal achievable benefit of EV demand flexibility in reducing overload-driven distribution upgrades at a regional scale? Establishing such an upper bound is computationally challenging, as it entails solving and certifying near-optimal solutions to population-scale optimization problems with millions of variables and both spatial and temporal coupling. We introduce MAC (Mobility-Aware Coordinated EV charging), a framework that quantifies the maximum potential of leveraging EV demand flexibility to mitigate DPN overloading risk without interrupting drivers' travel needs. (i) MAC expands feasible scheduling by coupling charging decisions over a full mobility horizon: instead of enforcing per-session energy recovery, it only requires the EV state-of-charge (SOC) to remain sufficient for upcoming trips. (ii) MAC is computationally scalable via an ADMM-based decomposition with custom subproblem solvers, and admits a decentralized interpretation in which dual variables act as locational-temporal price signals that implement the social optimum as a competitive equilibrium. Using high-resolution mobility trajectories and feeder hosting-capacity data in a future-oriented 30% EV adoption scenario for the San Francisco Bay Area, we show that MAC can dramatically reduce overload-driven upgrade requirements relative to unmanaged charging. This paper illustrates how trajectory-coupled flexibility and scalable, certifiable optimization can provide actionable best-case benchmarks for DPN planning and operations.