Learning to Plan with Logical Automata

Brandon Araki, Kiran Vodrahalli, Cristian Ioan Vasile, and Daniela Rus. Learning to Plan with Logical Automata. In Infer to Control, Workshop on Probabilistic Reinforcement Learning and Structured Control (Infer2Control), NIPS, page Poster, Montreal, Canada, December 2018. link.

Published date:

Saturday, December 8, 2018

Type:

Poster

PDF:

NeurIPS 2018 Infer2Control

BibTex:

NIPS_2018.bib

Abstract

We present a framework for combining imitation learning and logical automata and introduce the Logic-based Value Iteration Network (LVIN) model. By appending a ‘logical’ dimension to the state space of the environment, LVIN can recover and incorporate the transition matrix of a finite state automaton derived from a Linear Temporal Logic formula into the policy learned through imitation. This approach improves upon the original VIN in several ways: LVIN (1) is capable of learning logic corresponding to long sequences of steps, (2) adapts easily to new specifications, and (3) is amenable to correction after faulty expert demonstrations (e.g., in a driving domain).

Tags:

Automata

Value Iteration Networks

Reinforcement Learning

Neural Networks

Machine Learning

Markov Decision Process

Robotics