AINov 18, 2017

Learning to select computations

arXiv:1711.06892v338 citations
Originality Highly original
AI Analysis

This addresses the challenge of computational efficiency in AI systems, offering a domain-general solution that is incremental over existing heuristics.

The paper tackled the problem of efficiently selecting computations under limited resources by proposing Bayesian metalevel policy search (BMPS), which achieved near-optimal performance across three metareasoning domains and demonstrated practical utility in an emergency management scenario.

The efficient use of limited computational resources is an essential ingredient of intelligence. Selecting computations optimally according to rational metareasoning would achieve this, but this is computationally intractable. Inspired by psychology and neuroscience, we propose the first concrete and domain-general learning algorithm for approximating the optimal selection of computations: Bayesian metalevel policy search (BMPS). We derive this general, sample-efficient search algorithm for a computation-selecting metalevel policy based on the insight that the value of information lies between the myopic value of information and the value of perfect information. We evaluate BMPS on three increasingly difficult metareasoning problems: when to terminate computation, how to allocate computation between competing options, and planning. Across all three domains, BMPS achieved near-optimal performance and compared favorably to previously proposed metareasoning heuristics. Finally, we demonstrate the practical utility of BMPS in an emergency management scenario, even accounting for the overhead of metareasoning.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes