LG MLApr 1, 2020

Statistically Model Checking PCTL Specifications on Markov Decision Processes via Reinforcement Learning

Yu Wang, Nima Roohi, Matthew West, Mahesh Viswanathan, Geir E. Dullerud

arXiv:2004.00273v29.017 citations

Originality Incremental advance

AI Analysis

This work addresses formal verification challenges for control systems, but it is incremental as it builds on existing statistical model checking and reinforcement learning techniques.

The paper tackles the problem of statistically model checking PCTL specifications on Markov Decision Processes by using reinforcement learning to search for feasible policies, and develops a method with provable error guarantees, evaluating it on case studies.

Probabilistic Computation Tree Logic (PCTL) is frequently used to formally specify control objectives such as probabilistic reachability and safety. In this work, we focus on model checking PCTL specifications statistically on Markov Decision Processes (MDPs) by sampling, e.g., checking whether there exists a feasible policy such that the probability of reaching certain goal states is greater than a threshold. We use reinforcement learning to search for such a feasible policy for PCTL specifications, and then develop a statistical model checking (SMC) method with provable guarantees on its error. Specifically, we first use upper-confidence-bound (UCB) based Q-learning to design an SMC algorithm for bounded-time PCTL specifications, and then extend this algorithm to unbounded-time specifications by identifying a proper truncation time by checking the PCTL specification and its negation at the same time. Finally, we evaluate the proposed method on case studies.

View on arXiv PDF

Similar