Counting to Explore and Generalize in Text-based Games

Xingdi Yuan , Marc-Alexandre C\^ot\'e , Alessandro Sordoni , Romain Laroche , Remi Tachet des Combes , Matthew Hausknecht , Adam Trischler

Authors on Pith no claims yet

classification 💻 cs.CL cs.LG

keywords text-basedgamesagentdifficultygeneralizepoliciesapproacheschain

0 comments

read the original abstract

We propose a recurrent RL agent with an episodic exploration mechanism that helps discovering good policies in text-based game environments. We show promising results on a set of generated text-based games of varying difficulty where the goal is to collect a coin located at the end of a chain of rooms. In contrast to previous text-based RL approaches, we observe that our agent learns policies that generalize to unseen games of greater difficulty.

This paper has not been read by Pith yet.

discussion (0)

Forward citations

Cited by 1 Pith paper

Reviewed papers in the Pith corpus that reference this work. Sorted by Pith novelty score.

ALFWorld: Aligning Text and Embodied Environments for Interactive Learning
cs.CL 2020-10 conditional novelty 6.0

ALFWorld aligns text-based and embodied visual environments so agents can learn abstract policies in TextWorld that transfer to better performance on ALFRED tasks than visual-only training.