AINCSep 2, 2022

Object-based active inference

arXiv:2209.01258v18 citationsh-index: 22
Originality Incremental advance
AI Analysis

This work addresses the challenge of object-based perception and action for AI agents, representing an incremental improvement by combining existing frameworks.

The paper tackled the problem of enabling agents to interact intelligently with objects in the world by integrating object-based representations into active inference, resulting in OBAI learning to segment and manipulate objects from video input in a simple environment.

The world consists of objects: distinct entities possessing independent properties and dynamics. For agents to interact with the world intelligently, they must translate sensory inputs into the bound-together features that describe each object. These object-based representations form a natural basis for planning behavior. Active inference (AIF) is an influential unifying account of perception and action, but existing AIF models have not leveraged this important inductive bias. To remedy this, we introduce 'object-based active inference' (OBAI), marrying AIF with recent deep object-based neural networks. OBAI represents distinct objects with separate variational beliefs, and uses selective attention to route inputs to their corresponding object slots. Object representations are endowed with independent action-based dynamics. The dynamics and generative model are learned from experience with a simple environment (active multi-dSprites). We show that OBAI learns to correctly segment the action-perturbed objects from video input, and to manipulate these objects towards arbitrary goals.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes