AIFeb 14, 2012

Improving the Scalability of Optimal Bayesian Network Learning with External-Memory Frontier Breadth-First Branch and Bound Search

arXiv:1202.3744v144 citations
Originality Incremental advance
AI Analysis

This work addresses scalability issues in Bayesian network structure learning for researchers and practitioners dealing with large datasets, representing an incremental improvement over existing A*-based methods.

The paper tackles the problem of learning the optimal structure of a Bayesian network by developing a memory-efficient heuristic search algorithm that uses frontier breadth-first branch and bound search and external memory storage, enabling it to solve significantly larger problems than the current state of the art.

Previous work has shown that the problem of learning the optimal structure of a Bayesian network can be formulated as a shortest path finding problem in a graph and solved using A* search. In this paper, we improve the scalability of this approach by developing a memory-efficient heuristic search algorithm for learning the structure of a Bayesian network. Instead of using A*, we propose a frontier breadth-first branch and bound search that leverages the layered structure of the search graph of this problem so that no more than two layers of the graph, plus solution reconstruction information, need to be stored in memory at a time. To further improve scalability, the algorithm stores most of the graph in external memory, such as hard disk, when it does not fit in RAM. Experimental results show that the resulting algorithm solves significantly larger problems than the current state of the art.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes