Detecting British Columbia Coastal Rainfall Patterns by Clustering Gaussian Processes
This work addresses the problem of exploratory cluster analysis for coastal rainfall data, which is incremental in applying Gaussian processes to this specific domain.
The paper tackled the problem of clustering coastal rainfall patterns in British Columbia using Gaussian processes, developing a method based on covariance kernel similarity to identify clusters that relate to El Niño and La Niña events, though not as simple binary groupings.
Functional data analysis is a statistical framework where data are assumed to follow some functional form. This method of analysis is commonly applied to time series data, where time, measured continuously or in discrete intervals, serves as the location for a function's value. Gaussian processes are a generalization of the multivariate normal distribution to function space and, in this paper, they are used to shed light on coastal rainfall patterns in British Columbia (BC). Specifically, this work addressed the question over how one should carry out an exploratory cluster analysis for the BC, or any similar, coastal rainfall data. An approach is developed for clustering multiple processes observed on a comparable interval, based on how similar their underlying covariance kernel is. This approach provides interesting insights into the BC data, and these insights can be framed in terms of El Niño and La Niña; however, the result is not simply one cluster representing El Niño years and another for La Niña years. From one perspective, the results show that clustering annual rainfall can potentially be used to identify extreme weather patterns.