NE LGMar 23, 2025

Reinforcement Learning-based Self-adaptive Differential Evolution through Automated Landscape Feature Learning

Hongshu Guo, Sijie Ma, Zechuan Huang, Yuzhi Hu, Zeyuan Ma, Xinglin Zhang, Yue-Jiao Gong

arXiv:2503.18061v119 citationsh-index: 9Has CodeGECCO

Originality Incremental advance

AI Analysis

This is an incremental improvement for researchers in black-box optimization, automating feature extraction to reduce human expertise requirements.

The paper tackles the problem of Meta-Black-Box-Optimization methods relying on human-crafted feature extraction by introducing RLDE-AFL, a method that automates feature learning during meta-learning, resulting in superior optimization performance compared to advanced DE methods and MetaBBO baselines on synthetic and realistic benchmarks.

Recently, Meta-Black-Box-Optimization (MetaBBO) methods significantly enhance the performance of traditional black-box optimizers through meta-learning flexible and generalizable meta-level policies that excel in dynamic algorithm configuration (DAC) tasks within the low-level optimization, reducing the expertise required to adapt optimizers for novel optimization tasks. Though promising, existing MetaBBO methods heavily rely on human-crafted feature extraction approach to secure learning effectiveness. To address this issue, this paper introduces a novel MetaBBO method that supports automated feature learning during the meta-learning process, termed as RLDE-AFL, which integrates a learnable feature extraction module into a reinforcement learning-based DE method to learn both the feature encoding and meta-level policy. Specifically, we design an attention-based neural network with mantissa-exponent based embedding to transform the solution populations and corresponding objective values during the low-level optimization into expressive landscape features. We further incorporate a comprehensive algorithm configuration space including diverse DE operators into a reinforcement learning-aided DAC paradigm to unleash the behavior diversity and performance of the proposed RLDE-AFL. Extensive benchmark results show that co-training the proposed feature learning module and DAC policy contributes to the superior optimization performance of RLDE-AFL to several advanced DE methods and recent MetaBBO baselines over both synthetic and realistic BBO scenarios. The source codes of RLDE-AFL are available at https://github.com/GMC-DRL/RLDE-AFL.

View on arXiv PDF Code

Similar