Online Abstraction with MDP Homomorphisms for Deep Learning

Ondrej Biza; Robert Platt

arxiv: 1811.12929 · v2 · pith:HOEOHHY2new · submitted 2018-11-30 · 💻 cs.LG · stat.ML

Online Abstraction with MDP Homomorphisms for Deep Learning

Ondrej Biza , Robert Platt This is my paper

classification 💻 cs.LG stat.ML

keywords abstractionabstractionsalgorithmdeephomomorphismslearningmdpsaamas

0 comments

read the original abstract

Abstraction of Markov Decision Processes is a useful tool for solving complex problems, as it can ignore unimportant aspects of an environment, simplifying the process of learning an optimal policy. In this paper, we propose a new algorithm for finding abstract MDPs in environments with continuous state spaces. It is based on MDP homomorphisms, a structure-preserving mapping between MDPs. We demonstrate our algorithm's ability to learn abstractions from collected experience and show how to reuse the abstractions to guide exploration in new tasks the agent encounters. Our novel task transfer method outperforms baselines based on a deep Q-network in the majority of our experiments. The source code is at https://github.com/ondrejba/aamas_19.

This paper has not been read by Pith yet.

discussion (0)

Forward citations

Cited by 1 Pith paper

Reviewed papers in the Pith corpus that reference this work. Sorted by Pith novelty score.

Task-Induced Representational Invariances Depend on Learning Objective in Deep RL
cs.LG 2026-06 unverdicted novelty 7.0

In navigation tasks, DQN learns MDP-homomorphism-invariant representations while PPO learns action-symmetric ones despite comparable performance, with effects on transfer and in LLMs.