MLLGMay 24, 2013

Characterizing A Database of Sequential Behaviors with Latent Dirichlet Hidden Markov Models

arXiv:1305.5734v1
Originality Incremental advance
AI Analysis

This work addresses sequence modeling for researchers in machine learning, but it is incremental as it builds on existing HMM and latent Dirichlet methods.

The paper tackles the problem of modeling sequential behaviors in databases by proposing a latent Dirichlet hidden Markov model (LDHMM), which improves generalization performance with better log-likelihood and competitive results in sequence classification on real-world datasets.

This paper proposes a generative model, the latent Dirichlet hidden Markov models (LDHMM), for characterizing a database of sequential behaviors (sequences). LDHMMs posit that each sequence is generated by an underlying Markov chain process, which are controlled by the corresponding parameters (i.e., the initial state vector, transition matrix and the emission matrix). These sequence-level latent parameters for each sequence are modeled as latent Dirichlet random variables and parameterized by a set of deterministic database-level hyper-parameters. Through this way, we expect to model the sequence in two levels: the database level by deterministic hyper-parameters and the sequence-level by latent parameters. To learn the deterministic hyper-parameters and approximate posteriors of parameters in LDHMMs, we propose an iterative algorithm under the variational EM framework, which consists of E and M steps. We examine two different schemes, the fully-factorized and partially-factorized forms, for the framework, based on different assumptions. We present empirical results of behavior modeling and sequence classification on three real-world data sets, and compare them to other related models. The experimental results prove that the proposed LDHMMs produce better generalization performance in terms of log-likelihood and deliver competitive results on the sequence classification problem.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes