LGAIOct 22, 2025

Optimizing the Unknown: Black Box Bayesian Optimization with Energy-Based Model and Reinforcement Learning

arXiv:2510.19530v11 citationsh-index: 2
Originality Incremental advance
AI Analysis

This work addresses a bottleneck in optimizing expensive functions for scientific and engineering applications, offering an incremental improvement over existing BO methods.

The paper tackles the problem of one-step bias in Bayesian Optimization (BO) for costly black-box functions by proposing REBMBO, which integrates Gaussian Processes with an Energy-Based Model and uses reinforcement learning for adaptive multi-step lookahead, achieving superior performance on synthetic and real-world benchmarks.

Existing Bayesian Optimization (BO) methods typically balance exploration and exploitation to optimize costly objective functions. However, these methods often suffer from a significant one-step bias, which may lead to convergence towards local optima and poor performance in complex or high-dimensional tasks. Recently, Black-Box Optimization (BBO) has achieved success across various scientific and engineering domains, particularly when function evaluations are costly and gradients are unavailable. Motivated by this, we propose the Reinforced Energy-Based Model for Bayesian Optimization (REBMBO), which integrates Gaussian Processes (GP) for local guidance with an Energy-Based Model (EBM) to capture global structural information. Notably, we define each Bayesian Optimization iteration as a Markov Decision Process (MDP) and use Proximal Policy Optimization (PPO) for adaptive multi-step lookahead, dynamically adjusting the depth and direction of exploration to effectively overcome the limitations of traditional BO methods. We conduct extensive experiments on synthetic and real-world benchmarks, confirming the superior performance of REBMBO. Additional analyses across various GP configurations further highlight its adaptability and robustness.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes