Control Synthesis from Linear Temporal Logic Specifications using Model-Free Reinforcement Learning

· 2019 · cs.RO · arXiv 1909.07299

1 Pith paper cite this work. Polarity classification is still indexing.

1 Pith paper citing it

open full Pith review browse 1 citing papers arXiv PDF

abstract

We present a reinforcement learning (RL) framework to synthesize a control policy from a given linear temporal logic (LTL) specification in an unknown stochastic environment that can be modeled as a Markov Decision Process (MDP). Specifically, we learn a policy that maximizes the probability of satisfying the LTL formula without learning the transition probabilities. We introduce a novel rewarding and path-dependent discounting mechanism based on the LTL formula such that (i) an optimal policy maximizing the total discounted reward effectively maximizes the probabilities of satisfying LTL objectives, and (ii) a model-free RL algorithm using these rewards and discount factors is guaranteed to converge to such policy. Finally, we illustrate the applicability of our RL-based synthesis approach on two motion planning case studies.

representative citing papers

Model-Free Reinforcement Learning for Stochastic Games with Linear Temporal Logic Objectives

cs.RO · 2020-10-02 · unverdicted · novelty 6.0

A model-free RL methodology is developed to maximize the probability of LTL satisfaction in unknown stochastic games when the derived DRA has a single Rabin pair, with a generalization providing lower bounds for multiple pairs.

citing papers explorer

Showing 1 of 1 citing paper.

Model-Free Reinforcement Learning for Stochastic Games with Linear Temporal Logic Objectives cs.RO · 2020-10-02 · unverdicted · none · ref 16 · internal anchor
A model-free RL methodology is developed to maximize the probability of LTL satisfaction in unknown stochastic games when the derived DRA has a single Rabin pair, with a generalization providing lower bounds for multiple pairs.

Control Synthesis from Linear Temporal Logic Specifications using Model-Free Reinforcement Learning

fields

years

verdicts

representative citing papers

citing papers explorer