pith. machine review for the scientific record. sign in

arxiv: 1405.4733 · v2 · submitted 2014-05-19 · 💻 cs.LO · cs.SY

Recognition: unknown

Multiple-Environment Markov Decision Processes

Authors on Pith no claims yet
classification 💻 cs.LO cs.SY
keywords mdpsmemdpsdecisionevenmarkovobservablepartiallyprocesses
0
0 comments X
read the original abstract

We introduce Multi-Environment Markov Decision Processes (MEMDPs) which are MDPs with a set of probabilistic transition functions. The goal in a MEMDP is to synthesize a single controller with guaranteed performances against all environments even though the environment is unknown a priori. While MEMDPs can be seen as a special class of partially observable MDPs, we show that several verification problems that are undecidable for partially observable MDPs, are decidable for MEMDPs and sometimes have even efficient solutions.

This paper has not been read by Pith yet.

discussion (0)

Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.

Forward citations

Cited by 1 Pith paper

Reviewed papers in the Pith corpus that reference this work. Sorted by Pith novelty score.

  1. Probing the Impact of Scale on Data-Efficient, Generalist Transformer World Models for Atari

    cs.LG 2026-05 unverdicted novelty 5.0

    Transformer world models on Atari exhibit game-specific scaling regimes, but joint training on 26 environments produces consistent monotonic gains that improve downstream control policies to a median normalized score ...