Temporal Logic Control of POMDPs via Label-based Stochastic Simulation Relations

Sofie Haesaert, Petter Nilsson, Cristian Ioan Vasile, Rohan Thakker, Ali-akbar Aghamohammadi, Aaron D. Ames, and Richard M. Murray. Temporal Logic Control of POMDPs via Label-based Stochastic Simulation Relations. In IFAC Conference on Analysis and Design of Hybrid Systems (ADHS), pages 271–276, Oxford, UK, July 2018. doi:10.1016/j.ifacol.2018.08.046.

Published date: 
Friday, July 13, 2018
Type: 
PDF: 
BibTex: 
Abstract

The synthesis of controllers guaranteeing linear temporal logic specifications on partially observable Markov decision processes (POMDP) via their belief models causes computational issues due to the continuous spaces. In this work,  we construct a finite-state abstraction on which a control policy is synthesized and refined back to the original belief model. We introduce a new notion of label-based approximate stochastic simulation to quantify the deviation between belief models. We develop a robust synthesis methodology that yields a lower bound on the satisfaction probability, by compensating for deviations a priori, and that utilizes a novel and less conservative control refinement.