PLASM-PH LGJul 21, 2025

Efficient dataset construction using active learning and uncertainty-aware neural networks for plasma turbulent transport surrogate models

Aaron Ho, Lorenzo Zanisi, Bram de Leeuw, Vincent Galvan, Pablo Rodriguez-Fernandez, Nathaniel T. Howard

MIT

arXiv:2507.15976v12.32 citationsh-index: 18Physics of Plasmas

Originality Incremental advance

AI Analysis

This addresses the need for efficient surrogate models in plasma physics, particularly for tokamak fusion plasmas, by providing a method to reduce training data requirements, though it is incremental as it builds on a previous proof-of-principle.

This work tackled the problem of constructing efficient datasets for data-driven surrogate models in plasma turbulent transport by using uncertainty-aware neural networks and active learning with a physics simulation code as a labeller. The result was models achieving an F1 classification performance of ~0.8 and an R2 regression performance of ~0.75 on an independent test set, scaling from 10^2 to 10^4 data points over 45 iterations.

This work demonstrates a proof-of-principle for using uncertainty-aware architectures, in combination with active learning techniques and an in-the-loop physics simulation code as a data labeller, to construct efficient datasets for data-driven surrogate model generation. Building off of a previous proof-of-principle successfully demonstrating training set reduction on static pre-labelled datasets, using the ADEPT framework, this strategy was applied again to the plasma turbulent transport problem within tokamak fusion plasmas, specifically the QuaLiKiz quasilinear electrostatic gyrokinetic turbulent transport code. While QuaLiKiz provides relatively fast evaluations, this study specifically targeted small datasets to serve as a proxy for more expensive codes, such as CGYRO or GENE. The newly implemented algorithm uses the SNGP architecture for the classification component of the problem and the BNN-NCP architecture for the regression component, training models for all turbulent modes (ITG, TEM, ETG) and all transport fluxes ($Q_e$, $Q_i$, $Γ_e$, $Γ_i$, and $Π_i$) described by the general QuaLiKiz output. With 45 active learning iterations, moving from a small initial training set of $10^{2}$ to a final set of $10^{4}$, the resulting models reached a $F_1$ classification performance of ~0.8 and a $R^2$ regression performance of ~0.75 on an independent test set across all outputs. This extrapolates to reaching the same performance and efficiency as the previous ADEPT pipeline, although on a problem with 1 extra input dimension. While the improvement rate achieved in this implementation diminishes faster than expected, the overall technique is formulated with components that can be upgraded and generalized to many surrogate modeling applications beyond plasma turbulent transport predictions.

View on arXiv PDF

Similar