OCLGFeb 16, 2021

Learning Symbolic Expressions: Mixed-Integer Formulations, Cuts, and Heuristics

arXiv:2102.08351v26 citations
AI Analysis

This work addresses the problem of learning regression functions without predefined forms for researchers and practitioners in machine learning, representing an incremental improvement with new cuts and heuristics.

The authors tackled symbolic regression by formulating it as a mixed-integer nonlinear program (MINLP) and introduced new cuts and a heuristic to improve its solution, achieving competitive results compared to existing methods in computational experiments.

In this paper we consider the problem of learning a regression function without assuming its functional form. This problem is referred to as symbolic regression. An expression tree is typically used to represent a solution function, which is determined by assigning operators and operands to the nodes. The symbolic regression problem can be formulated as a nonconvex mixed-integer nonlinear program (MINLP), where binary variables are used to assign operators and nonlinear expressions are used to propagate data values through nonlinear operators such as square, square root, and exponential. We extend this formulation by adding new cuts that improve the solution of this challenging MINLP. We also propose a heuristic that iteratively builds an expression tree by solving a restricted MINLP. We perform computational experiments and compare our approach with a mixed-integer program-based method and a neural-network-based method from the literature.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes