AILGROSYDec 17, 2021

Compositional Learning-based Planning for Vision POMDPs

arXiv:2112.09456v27 citations
Originality Incremental advance
AI Analysis

This addresses the challenge of efficient decision-making in real-world robotic or autonomous systems with visual inputs, representing an incremental improvement over existing vision POMDP methods.

The paper tackles the problem of planning in vision-based partially observable Markov decision processes (POMDPs) with high-dimensional image observations, proposing Visual Tree Search (VTS) which combines offline generative models with online planning to achieve robust performance and faster training, significantly outperforming baseline algorithms.

The Partially Observable Markov Decision Process (POMDP) is a powerful framework for capturing decision-making problems that involve state and transition uncertainty. However, most current POMDP planners cannot effectively handle high-dimensional image observations prevalent in real world applications, and often require lengthy online training that requires interaction with the environment. In this work, we propose Visual Tree Search (VTS), a compositional learning and planning procedure that combines generative models learned offline with online model-based POMDP planning. The deep generative observation models evaluate the likelihood of and predict future image observations in a Monte Carlo tree search planner. We show that VTS is robust to different types of image noises that were not present during training and can adapt to different reward structures without the need to re-train. This new approach significantly and stably outperforms several baseline state-of-the-art vision POMDP algorithms while using a fraction of the training time.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes