MLJul 17, 2015

Incremental Variational Inference for Latent Dirichlet Allocation

arXiv:1507.05016v24 citations
Originality Incremental advance
AI Analysis

This work addresses scalability issues in topic modeling for large datasets, offering an incremental method that is faster and more stable, though it is incremental/hybrid in nature.

The paper tackles the problem of scaling latent Dirichlet allocation (LDA) to massive document collections by introducing incremental variational inference, which processes data efficiently without requiring a learning rate, converges faster, and monotonically increases the variational bound, with the distributed version achieving comparable performance and significant speed-up.

We introduce incremental variational inference and apply it to latent Dirichlet allocation (LDA). Incremental variational inference is inspired by incremental EM and provides an alternative to stochastic variational inference. Incremental LDA can process massive document collections, does not require to set a learning rate, converges faster to a local optimum of the variational bound and enjoys the attractive property of monotonically increasing it. We study the performance of incremental LDA on large benchmark data sets. We further introduce a stochastic approximation of incremental variational inference which extends to the asynchronous distributed setting. The resulting distributed algorithm achieves comparable performance as single host incremental variational inference, but with a significant speed-up.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes