LGDCSIMLSep 3, 2013

Online Tensor Methods for Learning Latent Variable Models

arXiv:1309.0787v555 citations
Originality Incremental advance
AI Analysis

This work provides a computationally efficient solution for latent variable modeling in social networks and text analysis, though it is incremental as it builds on existing tensor methods.

The authors tackled community detection and topic modeling by developing an online tensor decomposition method using stochastic gradient descent, which achieved improved accuracy and execution times several orders of magnitude faster than state-of-the-art algorithms on datasets like Facebook and New York Times.

We introduce an online tensor decomposition based approach for two latent variable modeling problems namely, (1) community detection, in which we learn the latent communities that the social actors in social networks belong to, and (2) topic modeling, in which we infer hidden topics of text articles. We consider decomposition of moment tensors using stochastic gradient descent. We conduct optimization of multilinear operations in SGD and avoid directly forming the tensors, to save computational and storage costs. We present optimized algorithm in two platforms. Our GPU-based implementation exploits the parallelism of SIMD architectures to allow for maximum speed-up by a careful optimization of storage and data transfer, whereas our CPU-based implementation uses efficient sparse matrix computations and is suitable for large sparse datasets. For the community detection problem, we demonstrate accuracy and computational efficiency on Facebook, Yelp and DBLP datasets, and for the topic modeling problem, we also demonstrate good performance on the New York Times dataset. We compare our results to the state-of-the-art algorithms such as the variational method, and report a gain of accuracy and a gain of several orders of magnitude in the execution time.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes