Investigating Recurrence and Eligibility Traces in Deep Q-Networks

Doina Precup; Jean Harb

arxiv: 1704.05495 · v1 · pith:4L2YG6XQnew · submitted 2017-04-18 · 💻 cs.AI · cs.LG

Investigating Recurrence and Eligibility Traces in Deep Q-Networks

Jean Harb , Doina Precup This is my paper

classification 💻 cs.AI cs.LG

keywords eligibilitytracesatarirecurrenttrainingusedbackbenefits

0 comments

read the original abstract

Eligibility traces in reinforcement learning are used as a bias-variance trade-off and can often speed up training time by propagating knowledge back over time-steps in a single update. We investigate the use of eligibility traces in combination with recurrent networks in the Atari domain. We illustrate the benefits of both recurrent nets and eligibility traces in some Atari games, and highlight also the importance of the optimization used in the training.

This paper has not been read by Pith yet.

discussion (0)

Forward citations

Cited by 1 Pith paper

Reviewed papers in the Pith corpus that reference this work. Sorted by Pith novelty score.

Trace-Mediated Peak Bias: Bridging Temporal Credit Assignment and Cognitive Heuristics in Deep Reinforcement Learning
cs.LG 2026-06 unverdicted novelty 5.0

Eligibility traces in deep RL create a peak bias by amplifying distal TD errors into gradient shocks that fixed-step SGD cannot normalize, leading to overestimation of peak-reward trajectories and a mechanistic accoun...