CLOct 29, 2023

PACuna: Automated Fine-Tuning of Language Models for Particle Accelerators

Antonin Sulc, Raimund Kammering, Annika Eichler, Tim Wilksen

arXiv:2310.19106v31.33 citationsh-index: 28Has Code

Originality Incremental advance

AI Analysis

This provides an intelligent assistant for particle accelerator facilities to address domain-specific questions that commercial assistants cannot handle.

The researchers tackled the challenge of navigating complex particle accelerator information by introducing PACuna, a fine-tuned language model that demonstrates proficiency in answering intricate accelerator questions, validated by experts.

Navigating the landscape of particle accelerators has become increasingly challenging with recent surges in contributions. These intricate devices challenge comprehension, even within individual facilities. To address this, we introduce PACuna, a fine-tuned language model refined through publicly available accelerator resources like conferences, pre-prints, and books. We automated data collection and question generation to minimize expert involvement and make the data publicly available. PACuna demonstrates proficiency in addressing intricate accelerator questions, validated by experts. Our approach shows adapting language models to scientific domains by fine-tuning technical texts and auto-generated corpora capturing the latest developments can further produce pre-trained models to answer some intricate questions that commercially available assistants cannot and can serve as intelligent assistants for individual facilities.

View on arXiv PDF Code

Similar