LGDSMLJul 11, 2012

PAC-learning bounded tree-width Graphical Models

arXiv:1207.4151v177 citations
Originality Highly original
AI Analysis

This provides an efficient solution for learning complex probabilistic models in machine learning, addressing a previously NP-complete bottleneck for tree-width > 1.

The authors tackled the problem of efficiently learning bounded tree-width graphical models, achieving a polynomial-time PAC-learning algorithm that requires only a polynomial number of samples.

We show that the class of strongly connected graphical models with treewidth at most k can be properly efficiently PAC-learnt with respect to the Kullback-Leibler Divergence. Previous approaches to this problem, such as those of Chow ([1]), and Ho gen ([7]) have shown that this class is PAC-learnable by reducing it to a combinatorial optimization problem. However, for k > 1, this problem is NP-complete ([15]), and so unless P=NP, these approaches will take exponential amounts of time. Our approach differs significantly from these, in that it first attempts to find approximate conditional independencies by solving (polynomially many) submodular optimization problems, and then using a dynamic programming formulation to combine the approximate conditional independence information to derive a graphical model with underlying graph of the tree-width specified. This gives us an efficient (polynomial time in the number of random variables) PAC-learning algorithm which requires only polynomial number of samples of the true distribution, and only polynomial running time.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes