The Effects of Population Size on the Performance of BEAGLE GPU-Based Genetic Programming Runs
For researchers using GPU-based genetic programming, this work provides insights into tuning population size for symbolic regression, though the findings are incremental.
This paper investigates how GPU-enabled population sizes affect the success of BEAGLE-based genetic programming for symbolic regression, finding that some problems benefit from narrow searches (1000 individuals) while others benefit from broad searches (10 million individuals), and introduces stepped population sizes to balance breadth and depth.
The Beagle framework, through GPU-based Genetic Programming, enables population dynamics previously unattainable (within practical time frames) by CPU-constrained Genetic Programming systems. This work explores how GPU-enabled population sizes impact the success of training for symbolic regression problems. Specifically, when using constant population sizes, we see benefits of using very narrow and deep searches (as narrow as 1000 individuals) for some problems, while other problems benefit from very broad and shallow searches (as broad as 10 million individuals). We also explore stepped population sizes that start with large populations and drop to small populations to balance the breadth and depth of search.