LGAIOct 30, 2023

ExPT: Synthetic Pretraining for Few-Shot Experimental Design

arXiv:2310.19961v126 citationsh-index: 9Has Code
Originality Incremental advance
AI Analysis

This addresses sample efficiency challenges in science and engineering fields where real-world evaluations are costly, though it appears incremental as it builds on existing transformer and pretraining approaches.

The paper tackles the problem of few-shot experimental design where only limited labeled data is available, by introducing ExPT, a foundation model that uses synthetic pretraining and in-context learning to generate optimal input designs, demonstrating superior generality and performance compared to existing methods.

Experimental design is a fundamental problem in many science and engineering fields. In this problem, sample efficiency is crucial due to the time, money, and safety costs of real-world design evaluations. Existing approaches either rely on active data collection or access to large, labeled datasets of past experiments, making them impractical in many real-world scenarios. In this work, we address the more challenging yet realistic setting of few-shot experimental design, where only a few labeled data points of input designs and their corresponding values are available. We approach this problem as a conditional generation task, where a model conditions on a few labeled examples and the desired output to generate an optimal input design. To this end, we introduce Experiment Pretrained Transformers (ExPT), a foundation model for few-shot experimental design that employs a novel combination of synthetic pretraining with in-context learning. In ExPT, we only assume knowledge of a finite collection of unlabelled data points from the input domain and pretrain a transformer neural network to optimize diverse synthetic functions defined over this domain. Unsupervised pretraining allows ExPT to adapt to any design task at test time in an in-context fashion by conditioning on a few labeled data points from the target task and generating the candidate optima. We evaluate ExPT on few-shot experimental design in challenging domains and demonstrate its superior generality and performance compared to existing methods. The source code is available at https://github.com/tung-nd/ExPT.git.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes