LGSep 25, 2023

Detach-ROCKET: Sequential feature selection for time series classification with random convolutional kernels

Gonzalo Uribarri, Federico Barone, Alessio Ansuini, Erik Fransén

arXiv:2309.14518v39.814 citationsh-index: 7Has Code

Originality Incremental advance

AI Analysis

This work addresses computational efficiency and interpretability for researchers and practitioners in time series classification, though it is incremental as it builds on existing ROCKET methods.

The paper tackled the problem of redundant features in ROCKET-based time series classification models, which increase computational load and reduce generalization, by introducing Sequential Feature Detachment (SFD) to prune non-essential features. The result was Detach-ROCKET, which on the largest binary UCR dataset improved test accuracy by 0.6% while reducing features by 98.9%.

Time Series Classification (TSC) is essential in fields like medicine, environmental science, and finance, enabling tasks such as disease diagnosis, anomaly detection, and stock price analysis. While machine learning models like Recurrent Neural Networks and InceptionTime are successful in numerous applications, they can face scalability issues due to computational requirements. Recently, ROCKET has emerged as an efficient alternative, achieving state-of-the-art performance and simplifying training by utilizing a large number of randomly generated features from the time series data. However, many of these features are redundant or non-informative, increasing computational load and compromising generalization. Here we introduce Sequential Feature Detachment (SFD) to identify and prune non-essential features in ROCKET-based models, such as ROCKET, MiniRocket, and MultiRocket. SFD estimates feature importance using model coefficients and can handle large feature sets without complex hyperparameter tuning. Testing on the UCR archive shows that SFD can produce models with better test accuracy using only 10\% of the original features. We named these pruned models Detach-ROCKET. We also present an end-to-end procedure for determining an optimal balance between the number of features and model accuracy. On the largest binary UCR dataset, Detach-ROCKET improves test accuracy by 0.6\% while reducing features by 98.9\%. By enabling a significant reduction in model size without sacrificing accuracy, our methodology improves computational efficiency and contributes to model interpretability. We believe that Detach-ROCKET will be a valuable tool for researchers and practitioners working with time series data, who can find a user-friendly implementation of the model at \url{https://github.com/gon-uri/detach_rocket}.

View on arXiv PDF Code

Similar