Self-Attentional Credit Assignment for Transfer in Reinforcement Learning

Johan Ferret; Matthieu Geist; Olivier Pietquin; Rapha\"el Marinier

arxiv: 1907.08027 · v2 · pith:MQIUGB2Znew · submitted 2019-07-18 · 💻 cs.LG · cs.AI

Self-Attentional Credit Assignment for Transfer in Reinforcement Learning

Johan Ferret , Rapha\"el Marinier , Matthieu Geist , Olivier Pietquin This is my paper

classification 💻 cs.LG cs.AI

keywords transfercreditlearningabilityassignassignmentfunctionnovel

0 comments

read the original abstract

The ability to transfer knowledge to novel environments and tasks is a sensible desiderata for general learning agents. Despite the apparent promises, transfer in RL is still an open and little exploited research area. In this paper, we take a brand-new perspective about transfer: we suggest that the ability to assign credit unveils structural invariants in the tasks that can be transferred to make RL more sample-efficient. Our main contribution is SECRET, a novel approach to transfer learning for RL that uses a backward-view credit assignment mechanism based on a self-attentive architecture. Two aspects are key to its generality: it learns to assign credit as a separate offline supervised process and exclusively modifies the reward function. Consequently, it can be supplemented by transfer methods that do not modify the reward function and it can be plugged on top of any RL algorithm.

This paper has not been read by Pith yet.

discussion (0)

Forward citations

Cited by 1 Pith paper

Reviewed papers in the Pith corpus that reference this work. Sorted by Pith novelty score.

Decision Transformer: Reinforcement Learning via Sequence Modeling
cs.LG 2021-06 accept novelty 8.0

Decision Transformer casts RL as autoregressive sequence modeling conditioned on desired returns, past states and actions, matching or exceeding offline RL baselines on Atari, Gym and Key-to-Door tasks.