Multiple-Environment Markov Decision Processes

Jean-Fran\c{c}ois Raskin , Ocan Sankur

Authors on Pith no claims yet

classification 💻 cs.LO cs.SY

keywords mdpsmemdpsdecisionevenmarkovobservablepartiallyprocesses

read the original abstract

We introduce Multi-Environment Markov Decision Processes (MEMDPs) which are MDPs with a set of probabilistic transition functions. The goal in a MEMDP is to synthesize a single controller with guaranteed performances against all environments even though the environment is unknown a priori. While MEMDPs can be seen as a special class of partially observable MDPs, we show that several verification problems that are undecidable for partially observable MDPs, are decidable for MEMDPs and sometimes have even efficient solutions.

This paper has not been read by Pith yet.

discussion (0)

Forward citations

Cited by 1 Pith paper

Reviewed papers in the Pith corpus that reference this work. Sorted by Pith novelty score.

Probing the Impact of Scale on Data-Efficient, Generalist Transformer World Models for Atari
cs.LG 2026-05 unverdicted novelty 5.0

Transformer world models on Atari exhibit game-specific scaling regimes, but joint training on 26 environments produces consistent monotonic gains that improve downstream control policies to a median normalized score ...