pith. machine review for the scientific record. sign in

arxiv: 1806.11525 · v2 · submitted 2018-06-29 · 💻 cs.CL · cs.LG

Recognition: unknown

Counting to Explore and Generalize in Text-based Games

Authors on Pith no claims yet
classification 💻 cs.CL cs.LG
keywords text-basedgamesagentdifficultygeneralizepoliciesapproacheschain
0
0 comments X
read the original abstract

We propose a recurrent RL agent with an episodic exploration mechanism that helps discovering good policies in text-based game environments. We show promising results on a set of generated text-based games of varying difficulty where the goal is to collect a coin located at the end of a chain of rooms. In contrast to previous text-based RL approaches, we observe that our agent learns policies that generalize to unseen games of greater difficulty.

This paper has not been read by Pith yet.

discussion (0)

Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.

Forward citations

Cited by 1 Pith paper

Reviewed papers in the Pith corpus that reference this work. Sorted by Pith novelty score.

  1. ALFWorld: Aligning Text and Embodied Environments for Interactive Learning

    cs.CL 2020-10 conditional novelty 6.0

    ALFWorld aligns text-based and embodied visual environments so agents can learn abstract policies in TextWorld that transfer to better performance on ALFRED tasks than visual-only training.