LGAug 29, 2024

Data Quality Monitoring for the Hadron Calorimeters Using Transfer Learning for Anomaly Detection

Mulugeta Weldezgina Asres, Christian Walter Omlin, Long Wang, Pavel Parygin, David Yu, Jay Dittmann, The CMS-HCAL Collaboration

arXiv:2408.16612v36.43 citationsh-index: 109Has Code

Originality Incremental advance

AI Analysis

This work addresses data curation and anomaly detection in high-dimensional spatio-temporal sensor systems, such as particle physics experiments, but is incremental as it adapts existing transfer learning methods to a specific domain.

The study tackled the challenge of data quality monitoring for the Hadron Calorimeters at CERN by applying transfer learning with a hybrid autoencoder for anomaly detection, achieving improved model accuracy and robustness while reducing trainable parameters and mitigating data contamination effects.

The proliferation of sensors brings an immense volume of spatio-temporal (ST) data in many domains, including monitoring, diagnostics, and prognostics applications. Data curation is a time-consuming process for a large volume of data, making it challenging and expensive to deploy data analytics platforms in new environments. Transfer learning (TL) mechanisms promise to mitigate data sparsity and model complexity by utilizing pre-trained models for a new task. Despite the triumph of TL in fields like computer vision and natural language processing, efforts on complex ST models for anomaly detection (AD) applications are limited. In this study, we present the potential of TL within the context of high-dimensional ST AD with a hybrid autoencoder architecture, incorporating convolutional, graph, and recurrent neural networks. Motivated by the need for improved model accuracy and robustness, particularly in scenarios with limited training data on systems with thousands of sensors, this research investigates the transferability of models trained on different sections of the Hadron Calorimeter of the Compact Muon Solenoid experiment at CERN. The key contributions of the study include exploring TL's potential and limitations within the context of encoder and decoder networks, revealing insights into model initialization and training configurations that enhance performance while substantially reducing trainable parameters and mitigating data contamination effects. Code: https://github.com/muleina/CMS\_HCAL\_ML\_OnlineDQM .

View on arXiv PDF Code

Similar