LG HEP-EX INS-DETJan 18, 2024

SymbolNet: Neural Symbolic Regression with Adaptive Dynamic Pruning for Compression

Ho Fung Tsoi, Vladimir Loncar, Sridhara Dasu, Philip Harris

arXiv:2401.09949v313.415 citationsHas CodeMachine Learning: Science and Technology

Originality Incremental advance

AI Analysis

This enables low-latency inference on custom hardware like FPGAs for resource-constrained environments such as high-energy physics experiments, representing an incremental improvement over existing symbolic regression methods.

The paper tackles the challenge of finding compact symbolic expressions for high-dimensional datasets by proposing SymbolNet, a neural network approach to symbolic regression with adaptive dynamic pruning, which achieves effective compression while maintaining performance on tasks like LHC jet tagging (16 inputs), MNIST (784 inputs), and SVHN (3072 inputs).

Compact symbolic expressions have been shown to be more efficient than neural network models in terms of resource consumption and inference speed when implemented on custom hardware such as FPGAs, while maintaining comparable accuracy~\cite{tsoi2023symbolic}. These capabilities are highly valuable in environments with stringent computational resource constraints, such as high-energy physics experiments at the CERN Large Hadron Collider. However, finding compact expressions for high-dimensional datasets remains challenging due to the inherent limitations of genetic programming, the search algorithm of most symbolic regression methods. Contrary to genetic programming, the neural network approach to symbolic regression offers scalability to high-dimensional inputs and leverages gradient methods for faster equation searching. Common ways of constraining expression complexity often involve multistage pruning with fine-tuning, which can result in significant performance loss. In this work, we propose $\tt{SymbolNet}$, a neural network approach to symbolic regression specifically designed as a model compression technique, aimed at enabling low-latency inference for high-dimensional inputs on custom hardware such as FPGAs. This framework allows dynamic pruning of model weights, input features, and mathematical operators in a single training process, where both training loss and expression complexity are optimized simultaneously. We introduce a sparsity regularization term for each pruning type, which can adaptively adjust its strength, leading to convergence at a target sparsity ratio. Unlike most existing symbolic regression methods that struggle with datasets containing more than $\mathcal{O}(10)$ inputs, we demonstrate the effectiveness of our model on the LHC jet tagging task (16 inputs), MNIST (784 inputs), and SVHN (3072 inputs).

View on arXiv PDF Code

Similar