LG AIMar 16

In-Context Symbolic Regression for Robustness-Improved Kolmogorov-Arnold Networks

Francesco Sovrano, Lidia Losavio, Giulia Vilone, Marc Langheinrich

arXiv:2603.1525010.4h-index: 6

Predicted impact top 85% in LG · last 90 daysOriginality Incremental advance

AI Analysis

This addresses the bottleneck of extracting interpretable symbolic expressions from KANs for scientific machine learning, offering incremental improvements over existing methods.

The paper tackled the problem of symbolic extraction in Kolmogorov-Arnold Networks (KANs), which is sensitive and inefficient, by introducing in-context symbolic regression methods like Greedy in-context Symbolic Regression (GSR) and Gated Matching Pursuit (GMP), resulting in up to 99.8% reduction in median test MSE in robustness experiments.

Symbolic regression aims to replace black-box predictors with concise analytical expressions that can be inspected and validated in scientific machine learning. Kolmogorov-Arnold Networks (KANs) are well suited to this goal because each connection between adjacent units (an "edge") is parametrised by a learnable univariate function that can, in principle, be replaced by a symbolic operator. In practice, however, symbolic extraction is a bottleneck: the standard KAN-to-symbol approach fits operators to each learned edge function in isolation, making the discrete choice sensitive to initialisation and non-convex parameter fitting, and ignoring how local substitutions interact through the full network. We study in-context symbolic regression for operator extraction in KANs, and present two complementary instantiations. Greedy in-context Symbolic Regression (GSR) performs greedy, in-context selection by choosing edge replacements according to end-to-end loss improvement after brief fine-tuning. Gated Matching Pursuit (GMP) amortises this in-context selection by training a differentiable gated operator layer that places an operator library behind sparse gates on each edge; after convergence, gates are discretised (optionally followed by a short in-context greedy refinement pass). We quantify robustness via one-factor-at-a-time (OFAT) hyper-parameter sweeps and assess both predictive error and qualitative consistency of recovered formulas. Across several experiments, greedy in-context symbolic regression achieves up to 99.8% reduction in median OFAT test MSE.

View on arXiv PDF

Similar