LOFLSYSYMar 21, 2017

Permissive Supervisor Synthesis for Markov Decision Processes through Learning

arXiv:1703.0735111 citationsh-index: 37
AI Analysis

For designers of probabilistic systems like power grids and robotics, this provides a scalable method to synthesize permissive supervisors without building the full composed system.

The paper proposes a learning-based framework for permissive supervisor synthesis in Markov Decision Processes, using compositional model checking to avoid state space explosion. The approach is guaranteed to terminate and produce correct supervisors.

This paper considers the permissive supervisor synthesis for probabilistic systems modeled as Markov Decision Processes (MDP). Such systems are prevalent in power grids, transportation networks, communication networks and robotics. Unlike centralized planning and optimization based planning, we propose a novel supervisor synthesis framework based on learning and compositional model checking to generate permissive local supervisors in a distributed manner. With the recent advance in assume-guarantee reasoning verification for probabilistic systems, building the composed system can be avoided to alleviate the state space explosion and our framework learn the supervisors iteratively based on the counterexamples from verification. Our approach is guaranteed to terminate in finite steps and to be correct.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes