DATA-ANHEP-EXMLAug 20, 2016

Reweighting with Boosted Decision Trees

arXiv:1608.05806v1146 citations
Originality Synthesis-oriented
AI Analysis

This addresses data correction issues for high energy physics experiments, but it appears incremental as it builds on existing reweighting approaches.

The paper tackles the problem of disagreement between Monte Carlo simulated and observed data in high energy physics by introducing a novel reweighting method using boosted decision trees, and it discusses quality assessment for this step.

Machine learning tools are commonly used in modern high energy physics (HEP) experiments. Different models, such as boosted decision trees (BDT) and artificial neural networks (ANN), are widely used in analyses and even in the software triggers. In most cases, these are classification models used to select the "signal" events from data. Monte Carlo simulated events typically take part in training of these models. While the results of the simulation are expected to be close to real data, in practical cases there is notable disagreement between simulated and observed data. In order to use available simulation in training, corrections must be introduced to generated data. One common approach is reweighting - assigning weights to the simulated events. We present a novel method of event reweighting based on boosted decision trees. The problem of checking the quality of reweighting step in analyses is also discussed.

Code Implementations2 repos
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes