AI OCNov 18, 2024

Robust Markov Decision Processes: A Place Where AI and Formal Methods Meet

Marnix Suilen, Thom Badings, Eline M. Bovy, David Parker, Nils Jansen

arXiv:2411.11451v219.626 citationsh-index: 55Principles of Verification

Originality Synthesis-oriented

AI Analysis

This is an incremental tutorial survey that provides a gentle introduction to RMDPs for researchers in AI and formal methods, without presenting new results.

This paper addresses the problem of Markov decision processes (MDPs) requiring precisely known transition probabilities by surveying robust MDPs (RMDPs), which use uncertainty sets to handle this limitation, and discusses their fundamentals, solution methods, and applications in areas like reinforcement learning.

Markov decision processes (MDPs) are a standard model for sequential decision-making problems and are widely used across many scientific areas, including formal methods and artificial intelligence (AI). MDPs do, however, come with the restrictive assumption that the transition probabilities need to be precisely known. Robust MDPs (RMDPs) overcome this assumption by instead defining the transition probabilities to belong to some uncertainty set. We present a gentle survey on RMDPs, providing a tutorial covering their fundamentals. In particular, we discuss RMDP semantics and how to solve them by extending standard MDP methods such as value iteration and policy iteration. We also discuss how RMDPs relate to other models and how they are used in several contexts, including reinforcement learning and abstraction techniques. We conclude with some challenges for future work on RMDPs.

View on arXiv PDF

Similar