How the Move Acceptance Hyper-Heuristic Copes With Local Optima: Drastic Differences Between Jumps and Cliffs
This work addresses a theoretical gap in understanding hyper-heuristic performance for combinatorial optimization, providing insights into algorithm design for researchers in evolutionary computation, though it is incremental as it builds on prior analyses.
The paper resolves the open question of the Move Acceptance Hyper-Heuristic (MAHH) performance on jump functions, proving it is much slower than simple elitist evolutionary algorithms (EAs) with a runtime of at least Ω(n^(2m-1)/(2m-1)!) for gap size m = o(n^(1/2)), but shows that combining MAHH with global mutation can achieve runtime O(min{m n^m, n^(2m-1)/(m!Ω(m)^(m-2))}), suggesting hybrid approaches may be effective.
In recent work, Lissovoi, Oliveto, and Warwicker (Artificial Intelligence (2023)) proved that the Move Acceptance Hyper-Heuristic (MAHH) leaves the local optimum of the multimodal cliff benchmark with remarkable efficiency. With its $O(n^3)$ runtime, for almost all cliff widths $d,$ the MAHH massively outperforms the $Θ(n^d)$ runtime of simple elitist evolutionary algorithms (EAs). For the most prominent multimodal benchmark, the jump functions, the given runtime estimates of $O(n^{2m} m^{-Θ(m)})$ and $Ω(2^{Ω(m)})$, for gap size $m \ge 2$, are far apart and the real performance of MAHH is still an open question. In this work, we resolve this question. We prove that for any choice of the MAHH selection parameter~$p$, the expected runtime of the MAHH on a jump function with gap size $m = o(n^{1/2})$ is at least $Ω(n^{2m-1} / (2m-1)!)$. This renders the MAHH much slower than simple elitist evolutionary algorithms with their typical $O(n^m)$ runtime. We also show that the MAHH with the global bit-wise mutation operator instead of the local one-bit operator optimizes jump functions in time $O(\min\{m n^m,\frac{n^{2m-1}}{m!Ω(m)^{m-2}}\})$, essentially the minimum of the optimization times of the $(1+1)$ EA and the MAHH. This suggests that combining several ways to cope with local optima can be a fruitful approach.