DSLGPRApr 22, 2020

Hardness of Identity Testing for Restricted Boltzmann Machines and Potts models

arXiv:2004.10805v15 citations
Originality Incremental advance
AI Analysis

This addresses computational hardness in statistical inference for undirected graphical models, with implications for machine learning and theoretical computer science, though it is incremental as it extends prior results from Ising models to RBMs and Potts models.

The paper tackles the problem of identity testing for restricted Boltzmann machines (RBMs) and Potts models, proving that unless RP=NP, there is no polynomial-time algorithm when βd=ω(log n), while providing an efficient algorithm when βd=O(log n).

We study identity testing for restricted Boltzmann machines (RBMs), and more generally for undirected graphical models. Given sample access to the Gibbs distribution corresponding to an unknown or hidden model $M^*$ and given an explicit model $M$, can we distinguish if either $M = M^*$ or if they are (statistically) far apart? Daskalakis et al. (2018) presented a polynomial-time algorithm for identity testing for the ferromagnetic (attractive) Ising model. In contrast, for the antiferromagnetic (repulsive) Ising model, Bezáková et al. (2019) proved that unless $RP=NP$ there is no identity testing algorithm when $βd=ω(\log{n})$, where $d$ is the maximum degree of the visible graph and $β$ is the largest edge weight in absolute value. We prove analogous hardness results for RBMs (i.e., mixed Ising models on bipartite graphs), even when there are no latent variables or an external field. Specifically, we show that if $RP \neq NP$, then when $βd=ω(\log{n})$ there is no polynomial-time algorithm for identity testing for RBMs; when $βd =O(\log{n})$ there is an efficient identity testing algorithm that utilizes the structure learning algorithm of Klivans and Meka (2017). In addition, we prove similar lower bounds for purely ferromagnetic RBMs with inconsistent external fields, and for the ferromagnetic Potts model. Previous hardness results for identity testing of Bezáková et al. (2019) utilized the hardness of finding the maximum cuts, which corresponds to the ground states of the antiferromagnetic Ising model. Since RBMs are on bipartite graphs such an approach is not feasible. We instead introduce a general methodology to reduce from the corresponding approximate counting problem and utilize the phase transition that is exhibited by RBMs and the mean-field Potts model.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes