LGAIMay 18

Automated Kernel Discovery Towards Understanding High-dimensional Bayesian Optimization

arXiv:2605.2024918.7
Predicted impact top 18% in LG · last 90 daysOriginality Highly original
AI Analysis

This work addresses the bottleneck of manual kernel engineering in high-dimensional Bayesian optimization, offering an automated approach that outperforms existing methods.

Kernel Discovery introduces an LLM-driven evolutionary framework for high-dimensional Bayesian optimization that searches a broader kernel space beyond predefined composition rules and does not require conditioning on observations, achieving an average rank of 1.2 out of 17 on five benchmarks.

Gaussian Process (GP) kernels are central to Bayesian optimization (BO), yet designing effective kernels for high-dimensional problems still relies on extensive manual engineering. Existing automated approaches struggle in high dimensions for two bottlenecks: their kernel search space is limited to additions and multiplications of base kernels, and LLM-based approaches require conditioning on raw observations, which becomes infeasible due to context-length limits and the difficulty of extracting meaningful patterns. We introduce \textbf{Kernel Discovery}, a LLM-driven evolutionary framework for high-dimensional BO that searches a broader kernel space beyond predefined composition rules and does not require conditioning on observations. Motivated by the observation that directly prompting an LLM to generate kernel code yields syntactically varied but functionally identical kernels, we adopt a two-stage approach: an LLM first proposes novel mathematical forms, then a second LLM call converts each form into validated, executable code. We also propose a leave-one-out continuous ranked probability score (LOO-CRPS) as a selection criterion that penalizes overfitted kernels. On five high-dimensional BO benchmarks, our method achieves an average rank of \textbf{1.2 out of 17}, outperforming competitive baselines. We further analyze the discovered kernels to identify which kernels lead to improvements in high-dimensional BO.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes