AIDBJan 27, 2017

Efficiently Summarising Event Sequences with Rich Interleaving Patterns

arXiv:1701.08096v125 citations
AI Analysis

This addresses the challenge of efficiently mining key structures in databases for data analysis, though it appears incremental as it builds on existing pattern set mining with richer patterns.

The paper tackles the problem of summarizing sequential data by discovering a small set of interleaving patterns to describe the data well, resulting in a method that is orders of magnitude faster than the state of the art and produces better models.

Discovering the key structure of a database is one of the main goals of data mining. In pattern set mining we do so by discovering a small set of patterns that together describe the data well. The richer the class of patterns we consider, and the more powerful our description language, the better we will be able to summarise the data. In this paper we propose \ourmethod, a novel greedy MDL-based method for summarising sequential data using rich patterns that are allowed to interleave. Experiments show \ourmethod is orders of magnitude faster than the state of the art, results in better models, as well as discovers meaningful semantics in the form patterns that identify multiple choices of values.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes