SEHEP-EXJul 7, 2020

The CMS monitoring infrastructure and applications

arXiv:2007.03630v119 citationsHas Code
Originality Synthesis-oriented
AI Analysis

This work addresses the critical need for efficient operation and performance evaluation of the CMS distributed computing infrastructure, which is essential for processing large-scale physics data, but it is incremental as it builds on existing monitoring solutions tailored to specific experiment needs.

The paper tackles the challenge of monitoring the globally distributed computing infrastructure for the CMS experiment at CERN, which handles multi-petabyte datasets, by presenting a scalable, open-source architecture for real-time and historical monitoring of subsystems like workload and data management.

The globally distributed computing infrastructure required to cope with the multi-petabytes datasets produced by the Compact Muon Solenoid (CMS) experiment at the Large Hadron Collider (LHC) at CERN comprises several subsystems, such as workload management, data management, data transfers, and submission of users' and centrally managed production requests. The performance and status of all subsystems must be constantly monitored to guarantee the efficient operation of the whole infrastructure. Moreover, key metrics need to be tracked to evaluate and study the system performance over time. The CMS monitoring architecture allows both real-time and historical monitoring of a variety of data sources and is based on scalable and open source solutions tailored to satisfy the experiment's monitoring needs. We present the monitoring data flow and software architecture for the CMS distributed computing applications. We discuss the challenges, components, current achievements, and future developments of the CMS monitoring infrastructure.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes