Towards Foundation Models for Experimental Readout Systems Combining Discrete and Continuous Data
This work tackles experimental readout challenges in nuclear physics, specifically for the Electron Ion Collider, representing an incremental advancement in applying foundation models to domain-specific detector systems.
The authors developed a foundation model for nuclear physics that processes low-level detector inputs from Imaging Cherenkov Detectors, addressing challenges like resolution loss and limited conditional generation. The model achieved fast, high-fidelity generation of pixel and time sequences for Cherenkov photons, validated through closure tests, and demonstrated generalization to tasks like pion/kaon identification and noise filtering.
We present a (proto) Foundation Model for Nuclear Physics, capable of operating on low-level detector inputs from Imaging Cherenkov Detectors at the future Electron Ion Collider. Building upon established next-token prediction approaches, we aim to address potential challenges such as resolution loss from existing tokenization schemes and limited support for conditional generation. We propose four key innovations: (i) separate vocabularies for discrete and continuous variates, combined via Causal Multi-Head Cross-Attention (CMHCA), (ii) continuous kinematic conditioning through prepended context embeddings, (iii) scalable and simple, high-resolution continuous variate tokenization without joint vocabulary inflation, and (iv) class conditional generation through a Mixture of Experts. Our model enables fast, high-fidelity generation of pixel and time sequences for Cherenkov photons, validated through closure tests in the High Performance DIRC. We also show our model generalizes to reconstruction tasks such as pion/kaon identification, and noise filtering, in which we show its ability to leverage fine-tuning under specific objectives.