NEJun 19, 2020

Hybridizing the 1/5-th Success Rule with Q-Learning for Controlling the Mutation Rate of an Evolutionary Algorithm

Arina Buzdalova, Carola Doerr, Anna Rodionova

arXiv:2006.11026v17.53 citations

Originality Incremental advance

AI Analysis

This work addresses parameter control in evolutionary algorithms, which is crucial for optimizing performance, but it is incremental as it builds on existing methods like the one-fifth success rule and Q-learning.

The paper tackles the problem of tuning mutation rates in evolutionary algorithms by introducing a hybrid parameter control technique that combines the one-fifth success rule with Q-learning, demonstrating that it achieves equal or superior performance across all offspring population sizes and extends to multiple benchmark problems.

It is well known that evolutionary algorithms (EAs) achieve peak performance only when their parameters are suitably tuned to the given problem. Even more, it is known that the best parameter values can change during the optimization process. Parameter control mechanisms are techniques developed to identify and to track these values. Recently, a series of rigorous theoretical works confirmed the superiority of several parameter control techniques over EAs with best possible static parameters. Among these results are examples for controlling the mutation rate of the $(1+λ)$~EA when optimizing the OneMax problem. However, it was shown in [Rodionova et al., GECCO'19] that the quality of these techniques strongly depends on the offspring population size $λ$. We introduce in this work a new hybrid parameter control technique, which combines the well-known one-fifth success rule with Q-learning. We demonstrate that our HQL mechanism achieves equal or superior performance to all techniques tested in [Rodionova et al., GECCO'19] and this -- in contrast to previous parameter control methods -- simultaneously for all offspring population sizes $λ$. We also show that the promising performance of HQL is not restricted to OneMax, but extends to several other benchmark problems.

View on arXiv PDF

Similar