LGITJun 8, 2015

Empirical Studies on Symbolic Aggregation Approximation Under Statistical Perspectives for Knowledge Discovery in Time Series

arXiv:1506.02732v12 citations
Originality Synthesis-oriented
AI Analysis

This work provides an analytical framework for evaluating and improving symbolic representations in time series knowledge discovery, but it is incremental as it builds on the established SAX method.

The paper tackled the lack of empirical investigation into the statistical properties of Symbolic Aggregation approXimation (SAX) for time series analysis by applying and proposing statistical measurements, including a new information embedding cost (IEC) score, and found that SAX reduces complexity while preserving core information with significant embedding efficiency on benchmark and clinical datasets.

Symbolic Aggregation approXimation (SAX) has been the de facto standard representation methods for knowledge discovery in time series on a number of tasks and applications. So far, very little work has been done in empirically investigating the intrinsic properties and statistical mechanics in SAX words. In this paper, we applied several statistical measurements and proposed a new statistical measurement, i.e. information embedding cost (IEC) to analyze the statistical behaviors of the symbolic dynamics. Our experiments on the benchmark datasets and the clinical signals demonstrate that SAX can always reduce the complexity while preserving the core information embedded in the original time series with significant embedding efficiency. Our proposed IEC score provide a priori to determine if SAX is adequate for specific dataset, which can be generalized to evaluate other symbolic representations. Our work provides an analytical framework with several statistical tools to analyze, evaluate and further improve the symbolic dynamics for knowledge discovery in time series.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes