Deep Reinforcement Learning With Macro-Actions

Clemens Rosenbaum; Ishan P. Durugkar; Sridhar Mahadevan; Stefan Dernbach

arxiv: 1606.04615 · v1 · pith:55SI3NYRnew · submitted 2016-06-15 · 💻 cs.LG · cs.AI· cs.NE

Deep Reinforcement Learning With Macro-Actions

Ishan P. Durugkar , Clemens Rosenbaum , Stefan Dernbach , Sridhar Mahadevan This is my paper

classification 💻 cs.LG cs.AIcs.NE

keywords learningdeepreinforcementapproachesataricomplexconvergencemacro-actions

0 comments

read the original abstract

Deep reinforcement learning has been shown to be a powerful framework for learning policies from complex high-dimensional sensory inputs to actions in complex tasks, such as the Atari domain. In this paper, we explore output representation modeling in the form of temporal abstraction to improve convergence and reliability of deep reinforcement learning approaches. We concentrate on macro-actions, and evaluate these on different Atari 2600 games, where we show that they yield significant improvements in learning speed. Additionally, we show that they can even achieve better scores than DQN. We offer analysis and explanation for both convergence and final results, revealing a problem deep RL approaches have with sparse reward signals.

This paper has not been read by Pith yet.

discussion (0)

Forward citations

Cited by 2 Pith papers

Reviewed papers in the Pith corpus that reference this work. Sorted by Pith novelty score.

Reinforcement Learning with Discrete Diffusion Policies for Combinatorial Action Spaces
cs.LG 2025-09 unverdicted novelty 6.0

A method trains discrete diffusion policies for combinatorial RL by matching to a PMD-regularized target distribution, reporting SOTA performance and sample efficiency on DNA generation, macro-action, and multi-agent ...
Enhancing Human-Likeness in Reinforcement Learning Agents via Hierarchical Macro Action Quantization
cs.RO 2026-05 unverdicted novelty 3.0

HiMAQ applies hierarchical vector quantization to human demonstrations to generate macro actions that yield higher human-likeness scores than flat MAQ on D4RL while matching or exceeding success rates across IQL, SAC,...