LGJul 14, 2021

Conservative Objective Models for Effective Offline Model-Based Optimization

arXiv:2107.06882v1121 citations
Originality Highly original
AI Analysis

This addresses a key challenge in offline optimization for domains like synthetic biology and engineering where active data collection is costly or risky, offering a practical improvement over prior methods.

The paper tackles the problem of distributional shift in data-driven model-based optimization (MBO) by proposing conservative objective models (COMs), which learn a lower-bounding model to prevent overestimation on out-of-distribution inputs, and results show COMs outperform existing methods across domains like protein sequences and materials.

Computational design problems arise in a number of settings, from synthetic biology to computer architectures. In this paper, we aim to solve data-driven model-based optimization (MBO) problems, where the goal is to find a design input that maximizes an unknown objective function provided access to only a static dataset of prior experiments. Such data-driven optimization procedures are the only practical methods in many real-world domains where active data collection is expensive (e.g., when optimizing over proteins) or dangerous (e.g., when optimizing over aircraft designs). Typical methods for MBO that optimize the design against a learned model suffer from distributional shift: it is easy to find a design that "fools" the model into predicting a high value. To overcome this, we propose conservative objective models (COMs), a method that learns a model of the objective function that lower bounds the actual value of the ground-truth objective on out-of-distribution inputs, and uses it for optimization. Structurally, COMs resemble adversarial training methods used to overcome adversarial examples. COMs are simple to implement and outperform a number of existing methods on a wide range of MBO problems, including optimizing protein sequences, robot morphologies, neural network weights, and superconducting materials.

Code Implementations2 repos
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes