LG MLJul 4, 2012

Maximum Margin Bayesian Networks

Yuhong Guo, Dana Wilkinson, Dale Schuurmans

arXiv:1207.1382v132 citations

Originality Incremental advance

AI Analysis

This work addresses the challenge of integrating prior causal knowledge into discriminative learning for Bayesian networks, though it is incremental as it builds on existing maximum margin methods.

The authors tackled the problem of learning Bayesian network classifiers that maximize the margin, which is harder due to normalization constraints compared to undirected models like Markov networks. They developed an effective training algorithm that shows improved generalization performance over Markov networks when directed structure encodes relevant knowledge, with experimental results indicating this benefit.

We consider the problem of learning Bayesian network classifiers that maximize the marginover a set of classification variables. We find that this problem is harder for Bayesian networks than for undirected graphical models like maximum margin Markov networks. The main difficulty is that the parameters in a Bayesian network must satisfy additional normalization constraints that an undirected graphical model need not respect. These additional constraints complicate the optimization task. Nevertheless, we derive an effective training algorithm that solves the maximum margin training problem for a range of Bayesian network topologies, and converges to an approximate solution for arbitrary network topologies. Experimental results show that the method can demonstrate improved generalization performance over Markov networks when the directed graphical structure encodes relevant knowledge. In practice, the training technique allows one to combine prior knowledge expressed as a directed (causal) model with state of the art discriminative learning methods.

View on arXiv PDF

Similar