MLLGJun 10, 2019

Goodness-of-fit Test for Latent Block Models

arXiv:1906.03886v75 citations
Originality Incremental advance
AI Analysis

This provides a statistical test for biclustering in relational data, addressing a gap in latent block models, but it is incremental as it extends methods from stochastic block models.

The authors tackled the problem of determining row and column cluster numbers in latent block models, which lacked a statistical test method, by developing a new goodness-of-fit test that uses random matrix theory and experimentally demonstrated its effectiveness with asymptotic behavior and accuracy measurements.

Latent block models are used for probabilistic biclustering, which is shown to be an effective method for analyzing various relational data sets. However, there has been no statistical test method for determining the row and column cluster numbers of latent block models. Recent studies have constructed statistical-test-based methods for stochastic block models, which assume that the observed matrix is a square symmetric matrix and that the cluster assignments are the same for rows and columns. In this study, we developed a new goodness-of-fit test for latent block models to test whether an observed data matrix fits a given set of row and column cluster numbers, or it consists of more clusters in at least one direction of the row and the column. To construct the test method, we used a result from the random matrix theory for a sample covariance matrix. We experimentally demonstrated the effectiveness of the proposed method by showing the asymptotic behavior of the test statistic and measuring the test accuracy.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes