LG NEMay 11, 2024

Sharpness-Aware Minimization for Evolutionary Feature Construction in Regression

Hengzhe Zhang, Qi Chen, Bing Xue, Wolfgang Banzhaf, Mengjie Zhang

arXiv:2405.06869v16.43 citationsh-index: 59

Originality Incremental advance

AI Analysis

This addresses overfitting in symbolic regression for practitioners, offering an incremental improvement with a novel optimization approach.

The paper tackles overfitting in genetic programming-based evolutionary feature construction for regression by introducing sharpness-aware minimization to discover robust symbolic features, achieving superior performance over standard GP and state-of-the-art methods on 58 real-world datasets.

In recent years, genetic programming (GP)-based evolutionary feature construction has achieved significant success. However, a primary challenge with evolutionary feature construction is its tendency to overfit the training data, resulting in poor generalization on unseen data. In this research, we draw inspiration from PAC-Bayesian theory and propose using sharpness-aware minimization in function space to discover symbolic features that exhibit robust performance within a smooth loss landscape in the semantic space. By optimizing sharpness in conjunction with cross-validation loss, as well as designing a sharpness reduction layer, the proposed method effectively mitigates the overfitting problem of GP, especially when dealing with a limited number of instances or in the presence of label noise. Experimental results on 58 real-world regression datasets show that our approach outperforms standard GP as well as six state-of-the-art complexity measurement methods for GP in controlling overfitting. Furthermore, the ensemble version of GP with sharpness-aware minimization demonstrates superior performance compared to nine fine-tuned machine learning and symbolic regression algorithms, including XGBoost and LightGBM.

View on arXiv PDF

Similar