LG NEMay 6, 2025

Call for Action: towards the next generation of symbolic regression benchmark

Guilherme S. Imai Aldeia, Hengzhe Zhang, Geoffrey Bomarito, Miles Cranmer, Alcides Fonseca, Bogdan Burlacu, William G. La Cava, Fabrício Olivetti de França

arXiv:2505.03977v119.713 citationsh-index: 8GECCO Companion

Originality Synthesis-oriented

AI Analysis

This work addresses benchmarking challenges for symbolic regression researchers, though it is incremental as it builds on an existing benchmark.

The authors updated SRBench, a symbolic regression benchmark, by doubling the number of evaluated methods, refining metrics, and analyzing trade-offs between model complexity, accuracy, and energy consumption, finding no single algorithm dominates across all datasets.

Symbolic Regression (SR) is a powerful technique for discovering interpretable mathematical expressions. However, benchmarking SR methods remains challenging due to the diversity of algorithms, datasets, and evaluation criteria. In this work, we present an updated version of SRBench. Our benchmark expands the previous one by nearly doubling the number of evaluated methods, refining evaluation metrics, and using improved visualizations of the results to understand the performances. Additionally, we analyze trade-offs between model complexity, accuracy, and energy consumption. Our results show that no single algorithm dominates across all datasets. We propose a call for action from SR community in maintaining and evolving SRBench as a living benchmark that reflects the state-of-the-art in symbolic regression, by standardizing hyperparameter tuning, execution constraints, and computational resource allocation. We also propose deprecation criteria to maintain the benchmark's relevance and discuss best practices for improving SR algorithms, such as adaptive hyperparameter tuning and energy-efficient implementations.

View on arXiv PDF

Similar