AIFLDec 20, 2021

Demonstration Informed Specification Search

arXiv:2112.10807v43 citations
Originality Incremental advance
AI Analysis

This addresses the challenge of inferring complex temporal tasks for AI systems, though it is incremental as it builds on existing methods for specification learning.

The paper tackles the problem of learning temporal task specifications, such as automata and temporal logic, from expert demonstrations, and shows that their proposed DISS algorithm can efficiently identify tasks using only one or two demonstrations.

This paper considers the problem of learning temporal task specifications, e.g. automata and temporal logic, from expert demonstrations. Task specifications are a class of sparse memory augmented rewards with explicit support for temporal and Boolean composition. Three features make learning temporal task specifications difficult: (1) the (countably) infinite number of tasks under consideration; (2) an a-priori ignorance of what memory is needed to encode the task; and (3) the discrete solution space - typically addressed by (brute force) enumeration. To overcome these hurdles, we propose Demonstration Informed Specification Search (DISS): a family of algorithms requiring only black box access to a maximum entropy planner and a task sampler from labeled examples. DISS then works by alternating between conjecturing labeled examples to make the provided demonstrations less surprising and sampling tasks consistent with the conjectured labeled examples. We provide a concrete implementation of DISS in the context of tasks described by Deterministic Finite Automata, and show that DISS is able to efficiently identify tasks from only one or two expert demonstrations.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes