The authors compare multiple methods for incorporating action information into RNN state updates for RL and report empirical results on illustrative domains.
Line is the median over 1000 episodes, with the shaded region as the 1st and 3rd quantile over the same window
1 Pith paper cite this work. Polarity classification is still indexing.
1
Pith paper citing it
fields
cs.LG 1years
2026 1verdicts
UNVERDICTED 1representative citing papers
citing papers explorer
-
Investigating Action Encodings in Recurrent Neural Networks in Reinforcement Learning
The authors compare multiple methods for incorporating action information into RNN state updates for RL and report empirical results on illustrative domains.