LGNEMay 6, 2025

Call for Action: towards the next generation of symbolic regression benchmark

arXiv:2505.03977v111 citationsh-index: 8GECCO Companion
Originality Synthesis-oriented
AI Analysis

This work addresses benchmarking challenges for symbolic regression researchers, though it is incremental as it builds on an existing benchmark.

The authors updated SRBench, a symbolic regression benchmark, by doubling the number of evaluated methods, refining metrics, and analyzing trade-offs between model complexity, accuracy, and energy consumption, finding no single algorithm dominates across all datasets.

Symbolic Regression (SR) is a powerful technique for discovering interpretable mathematical expressions. However, benchmarking SR methods remains challenging due to the diversity of algorithms, datasets, and evaluation criteria. In this work, we present an updated version of SRBench. Our benchmark expands the previous one by nearly doubling the number of evaluated methods, refining evaluation metrics, and using improved visualizations of the results to understand the performances. Additionally, we analyze trade-offs between model complexity, accuracy, and energy consumption. Our results show that no single algorithm dominates across all datasets. We propose a call for action from SR community in maintaining and evolving SRBench as a living benchmark that reflects the state-of-the-art in symbolic regression, by standardizing hyperparameter tuning, execution constraints, and computational resource allocation. We also propose deprecation criteria to maintain the benchmark's relevance and discuss best practices for improving SR algorithms, such as adaptive hyperparameter tuning and energy-efficient implementations.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes