LG MLJun 1

Local Preferential Bayesian Optimization

Johanna Menn, Miriam Kober, Paul Brunzema, David Stenger, Sebastian Trimpe

arXiv:2606.0235112.1

Predicted impact top 57% in LG · last 90 daysOriginality Incremental advance

AI Analysis

For practitioners tuning expensive experiments with human feedback, this work addresses the inefficiency of global search in high-dimensional problems by introducing local optimization techniques.

Local PBO methods adapt trust-region and derivative-informed local search to pairwise preference feedback, achieving substantial reductions in cumulative regret on high-dimensional and complex landscapes compared to global preference-based baselines.

Bayesian optimization (BO) is a popular and effective approach for tuning expensive, noisy experiments, but requires the formulation of an explicit objective function. Preferential BO (PBO) removes this requirement by learning from pairwise human feedback, yet existing methods struggle to efficiently optimize beyond low- and medium-dimensional problems due to their global search approaches. We address this limitation by developing a family of local PBO methods that transfer key ideas from high-dimensional BO to the preferential setting. In particular, we introduce local PBO methods which adapt trust-region and derivative-informed local search to pairwise preference feedback, where the latter exploits first- and second-order derivatives of the Laplace-approximated GP posterior. Our benchmark on GP sample paths, standard optimization benchmark functions, and policy-search tasks shows that local PBO methods are especially effective in high-dimensional and complex landscapes with steep optima. Compared with global preference-based baselines, they can substantially reduce cumulative regret, making them particularly useful for real-world preference-based optimization tasks such as policy search.

View on arXiv PDF

Similar