NE AIJan 19

An Evolutionary Framework for Automatic Optimization Benchmark Generation via Large Language Models

Yuhiro Ono, Tomohiro Harada, Yukiya Miura

arXiv:2601.12723v12.5

Originality Incremental advance

AI Analysis

This addresses the need for more realistic and varied optimization benchmarks for algorithm developers, though it appears incremental as it builds on existing evolutionary and LLM methods.

The authors tackled the problem of creating diverse optimization benchmarks by proposing an evolutionary framework that uses large language models as generative operators to automatically generate benchmark problems. The framework successfully produced problems where a target algorithm outperformed a comparative one in over 80% of trials, with analysis showing distinct geometric characteristics.

Optimization benchmarks play a fundamental role in assessing algorithm performance; however, existing artificial benchmarks often fail to capture the diversity and irregularity of real-world problem structures, while benchmarks derived from real-world problems are costly and difficult to construct. To address these challenges, we propose an evolutionary automatic benchmark generation framework that leverages a large language model (LLM) as a generative operator, termed the LLM-driven evolutionary benchmark generator (LLM-EBG). In this framework, the LLM serves as an evolutionary operator that generates and evolves benchmark problems within a flexible, expressive representation space. As a case study, we generate unconstrained single-objective continuous minimization problems represented as mathematical expressions designed to induce significant performance differences between a genetic algorithm (GA) and differential evolution (DE). Experimental results show that LLM-EBG successfully produces benchmark problems in which the designated target algorithm consistently outperforms the comparative algorithm in more than 80\% of trials. Furthermore, exploratory landscape analysis reveals that benchmarks favoring GA are highly sensitive to variable scaling, demonstrating that the proposed framework can generate problems with distinct geometric characteristics that reflect the intrinsic search behaviors of different optimization algorithms.

View on arXiv PDF

Similar