Recognition: unknown
Counting to Explore and Generalize in Text-based Games
classification
💻 cs.CL
cs.LG
keywords
text-basedgamesagentdifficultygeneralizepoliciesapproacheschain
read the original abstract
We propose a recurrent RL agent with an episodic exploration mechanism that helps discovering good policies in text-based game environments. We show promising results on a set of generated text-based games of varying difficulty where the goal is to collect a coin located at the end of a chain of rooms. In contrast to previous text-based RL approaches, we observe that our agent learns policies that generalize to unseen games of greater difficulty.
This paper has not been read by Pith yet.
Forward citations
Cited by 1 Pith paper
-
ALFWorld: Aligning Text and Embodied Environments for Interactive Learning
ALFWorld aligns text-based and embodied visual environments so agents can learn abstract policies in TextWorld that transfer to better performance on ALFRED tasks than visual-only training.
discussion (0)
Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.