Transfer Learning Across Fast- and Full-Simulation Domains in High-Energy Physics

arXiv:2605.0747145.7

AI Analysis

For high-energy physics researchers, this demonstrates that fast simulation can produce reusable representations, reducing computational costs for training models on expensive fully simulated data.

This work systematically studies transfer learning from fast-simulated to fully simulated datasets in high-energy physics, showing that pretrained models consistently outperform baselines and reduce required target-domain training data by about a factor of two across three tasks.

Machine-learning models in high-energy physics are often trained on simulated data, where fully simulated samples are computationally expensive while fast simulation provides large statistics at reduced realism. In this work, we systematically study transfer learning between fast-simulated and fully simulated datasets in a realistic LHC environment. We consider three representative tasks, signal-background classification, quark-gluon jet tagging, and missing transverse energy reconstruction, using dense neural networks, graph neural networks, and transformer-based architectures. Models are pretrained on ATLAS-like fast simulation and adapted to CMS-like fast simulation and to fully simulated ATLAS Open Data. Across all tasks, pretrained models consistently outperform independently trained baselines and require significantly less target-domain training data, typically reducing the needed statistics by about a factor of two. These results demonstrate that fast simulation can be used to learn robust, reusable representations and motivate publishing trained models as reusable scientific assets beyond large foundation models.

View on arXiv PDF

Similar