COMP-PHLGAug 31, 2022

Learning Tree Structures from Leaves For Particle Decay Reconstruction

arXiv:2208.14924v211 citationsh-index: 109
Originality Incremental advance
AI Analysis

This addresses the challenge of reconstructing particle decay trees from observable final products in high-energy physics, where large combinatorial spaces make analytic solutions intractable, representing a domain-specific advancement.

The paper tackles the problem of reconstructing hierarchical tree structures from leaf data alone, introducing the LCAG matrix representation to enable end-to-end learning without prior assumptions, achieving up to 92.5% accuracy for trees with up to 6 leaves and 59.7% for up to 10 leaves in particle decay simulations.

In this work, we present a neural approach to reconstructing rooted tree graphs describing hierarchical interactions, using a novel representation we term the Lowest Common Ancestor Generations (LCAG) matrix. This compact formulation is equivalent to the adjacency matrix, but enables learning a tree's structure from its leaves alone without the prior assumptions required if using the adjacency matrix directly. Employing the LCAG therefore enables the first end-to-end trainable solution which learns the hierarchical structure of varying tree sizes directly, using only the terminal tree leaves to do so. In the case of high-energy particle physics, a particle decay forms a hierarchical tree structure of which only the final products can be observed experimentally, and the large combinatorial space of possible trees makes an analytic solution intractable. We demonstrate the use of the LCAG as a target in the task of predicting simulated particle physics decay structures using both a Transformer encoder and a Neural Relational Inference encoder Graph Neural Network. With this approach, we are able to correctly predict the LCAG purely from leaf features for a maximum tree-depth of $8$ in $92.5\%$ of cases for trees up to $6$ leaves (including) and $59.7\%$ for trees up to $10$ in our simulated dataset.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes