HC AIOct 23, 2023

Interactive AI Alignment: Specification, Process, and Evaluation Alignment

Michael Terry, Chinmay Kulkarni, Martin Wattenberg, Lucas Dixon, Meredith Ringel Morris

DeepMind

arXiv:2311.00710v221.933 citationsh-index: 19

Originality Synthesis-oriented

AI Analysis

This work addresses the challenge of ensuring AI systems correctly interpret and execute user goals in high-level interactions, though it is incremental as it builds on existing concepts.

The paper tackles the problem of aligning AI systems with user intentions in declarative interactions by defining three alignment objectives: specification, process, and evaluation alignment, and demonstrates their application using existing systems.

Modern AI enables a high-level, declarative form of interaction: Users describe the intended outcome they wish an AI to produce, but do not actually create the outcome themselves. In contrast, in traditional user interfaces, users invoke specific operations to create the desired outcome. This paper revisits the basic input-output interaction cycle in light of this declarative style of interaction, and connects concepts in AI alignment to define three objectives for interactive alignment of AI: specification alignment (aligning on what to do), process alignment (aligning on how to do it), and evaluation alignment (assisting users in verifying and understanding what was produced). Using existing systems as examples, we show how these user-centered views of AI alignment can be used descriptively, prescriptively, and as an evaluative aid.

View on arXiv PDF

Similar