LGMAMar 10, 2023

Gaussian Max-Value Entropy Search for Multi-Agent Bayesian Optimization

arXiv:2303.05694v115 citationsh-index: 32
Originality Incremental advance
AI Analysis

This work addresses computational bottlenecks in multi-agent Bayesian optimization for applications like robotics, though it is incremental as it builds on existing entropy search methods.

The authors tackled the computational inefficiency of multi-agent Bayesian optimization by proposing Gaussian Max-Value Entropy Search, which approximates the function maximum with a normal distribution to enable closed-form optimization, resulting in improved performance over baselines in numerical tests and stable source-seeking with limited noisy observations on real robots.

We study the multi-agent Bayesian optimization (BO) problem, where multiple agents maximize a black-box function via iterative queries. We focus on Entropy Search (ES), a sample-efficient BO algorithm that selects queries to maximize the mutual information about the maximum of the black-box function. One of the main challenges of ES is that calculating the mutual information requires computationally-costly approximation techniques. For multi-agent BO problems, the computational cost of ES is exponential in the number of agents. To address this challenge, we propose the Gaussian Max-value Entropy Search, a multi-agent BO algorithm with favorable sample and computational efficiency. The key to our idea is to use a normal distribution to approximate the function maximum and calculate its mutual information accordingly. The resulting approximation allows queries to be cast as the solution of a closed-form optimization problem which, in turn, can be solved via a modified gradient ascent algorithm and scaled to a large number of agents. We demonstrate the effectiveness of Gaussian max-value Entropy Search through numerical experiments on standard test functions and real-robot experiments on the source-seeking problem. Results show that the proposed algorithm outperforms the multi-agent BO baselines in the numerical experiments and can stably seek the source with a limited number of noisy observations on real robots.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes