HEP-PH LG HEP-EX DATA-ANAug 5, 2024

KAN we improve on HEP classification tasks? Kolmogorov-Arnold Networks applied to an LHC physics example

Johannes Erdmann, Florian Mausolf, Jan Lukas Späh

arXiv:2408.02743v27 citationsh-index: 10

Originality Synthesis-oriented

AI Analysis

This is an incremental study applying a new neural network type to a domain-specific physics classification problem.

The paper applied Kolmogorov-Arnold Networks (KANs) to a binary event classification task in high-energy physics, finding that small KANs offered interpretability advantages with only moderate performance loss compared to multilayer perceptrons, but did not improve parameter efficiency.

Recently, Kolmogorov-Arnold Networks (KANs) have been proposed as an alternative to multilayer perceptrons, suggesting advantages in performance and interpretability. We study a typical binary event classification task in high-energy physics including high-level features and comment on the performance and interpretability of KANs in this context. Consistent with expectations, we find that the learned activation functions of a one-layer KAN resemble the univariate log-likelihood ratios of the respective input features. In deeper KANs, the activations in the first layer differ from those in the one-layer KAN, which indicates that the deeper KANs learn more complex representations of the data, a pattern commonly observed in other deep-learning architectures. We study KANs with different depths and widths and we compare them to multilayer perceptrons in terms of performance and number of trainable parameters. For the chosen classification task, we do not find that KANs are more parameter efficient. However, small KANs may offer advantages in terms of interpretability that come at the cost of only a moderate loss in performance.

View on arXiv PDF

Similar