MLLGAug 17, 2024

Time Series Analysis by State Space Learning

arXiv:2408.09120v1h-index: 14Has Code
Originality Highly original
AI Analysis

This work addresses inefficiencies in time series forecasting and component extraction for big-data applications, offering a novel paradigm that extends to engineering domains beyond time series.

The paper tackles the limitations of traditional state-space models in time series analysis, such as Gaussian assumptions and inefficient subset selection for explanatory variables, by introducing the State Space Learning (SSL) framework, which demonstrates superior subset selection accuracy and outperforms traditional models on a dataset of 48,000 monthly time series from the M4 competition.

Time series analysis by state-space models is widely used in forecasting and extracting unobservable components like level, slope, and seasonality, along with explanatory variables. However, their reliance on traditional Kalman filtering frequently hampers their effectiveness, primarily due to Gaussian assumptions and the absence of efficient subset selection methods to accommodate the multitude of potential explanatory variables in today's big-data applications. Our research introduces the State Space Learning (SSL), a novel framework and paradigm that leverages the capabilities of statistical learning to construct a comprehensive framework for time series modeling and forecasting. By utilizing a regularized high-dimensional regression framework, our approach jointly extracts typical time series unobservable components, detects and addresses outliers, and selects the influence of exogenous variables within a high-dimensional space in polynomial time and global optimality guarantees. Through a controlled numerical experiment, we demonstrate the superiority of our approach in terms of subset selection of explanatory variables accuracy compared to relevant benchmarks. We also present an intuitive forecasting scheme and showcase superior performances relative to traditional time series models using a dataset of 48,000 monthly time series from the M4 competition. We extend the applicability of our approach to reformulate any linear state space formulation featuring time-varying coefficients into high-dimensional regularized regressions, expanding the impact of our research to other engineering applications beyond time series analysis. Finally, our proposed methodology is implemented within the Julia open-source package, ``StateSpaceLearning.jl".

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes