Multivariate Time Series Clustering for Environmental State Characterization of Ground-Based Gravitational-Wave Detectors
This work addresses a domain-specific problem for gravitational-wave detector operators by providing an automated tool to reduce manual monitoring burden and improve diagnostics, though it is incremental as it applies existing clustering methods to new data.
The paper tackles the problem of manually monitoring multiple seismic data streams in gravitational-wave detectors like LIGO, which is prone to human error, by developing an end-to-end machine learning pipeline for multivariate time series clustering to characterize environmental states and correlate clusters with detector events, achieving actionable insights for operators.
Gravitational-wave observatories like LIGO are large-scale, terrestrial instruments housed in infrastructure that spans a multi-kilometer geographic area and which must be actively controlled to maintain operational stability for long observation periods. Despite exquisite seismic isolation, they remain susceptible to seismic noise and other terrestrial disturbances that can couple undesirable vibrations into the instrumental infrastructure, potentially leading to control instabilities or noise artifacts in the detector output. It is, therefore, critical to characterize the seismic state of these observatories to identify a set of temporal patterns that can inform the detector operators in day-to-day monitoring and diagnostics. On a day-to-day basis, the operators monitor several seismically relevant data streams to diagnose operational instabilities and sources of noise using some simple empirically-determined thresholds. It can be untenable for a human operator to monitor multiple data streams in this manual fashion and thus a distillation of these data-streams into a more human-friendly format is sought. In this paper, we present an end-to-end machine learning pipeline for features-based multivariate time series clustering to achieve this goal and to provide actionable insights to the detector operators by correlating found clusters with events of interest in the detector.