LGSINov 20, 2023

Node Classification in Random Trees

arXiv:2311.12167v2h-index: 10
Originality Incremental advance
AI Analysis

This addresses the challenge of node classification in random trees for applications like natural language processing, though it is incremental as it builds on existing graph neural network and Markov network approaches.

The paper tackles the problem of modeling joint distributions of node label assignments in random trees without predetermined topology or observed labels, and demonstrates that their method outperforms baselines on the Stanford Sentiment Treebank dataset.

We propose a method for the classification of objects that are structured as random trees. Our aim is to model a distribution over the node label assignments in settings where the tree data structure is associated with node attributes (typically high dimensional embeddings). The tree topology is not predetermined and none of the label assignments are present during inference. Other methods that produce a distribution over node label assignment in trees (or more generally in graphs) either assume conditional independence of the label assignment, operate on a fixed graph topology, or require part of the node labels to be observed. Our method defines a Markov Network with the corresponding topology of the random tree and an associated Gibbs distribution. We parameterize the Gibbs distribution with a Graph Neural Network that operates on the random tree and the node embeddings. This allows us to estimate the likelihood of node assignments for a given random tree and use MCMC to sample from the distribution of node assignments. We evaluate our method on the tasks of node classification in trees on the Stanford Sentiment Treebank dataset. Our method outperforms the baselines on this dataset, demonstrating its effectiveness for modeling joint distributions of node labels in random trees.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes