pith. sign in

arxiv: 1710.05090 · v1 · pith:S3E6YHHUnew · submitted 2017-10-13 · 💻 cs.LG · stat.ML

Burn-In Demonstrations for Multi-Modal Imitation Learning

classification 💻 cs.LG stat.ML
keywords learningimitationpoliciesexpertmulti-modaltimeapproachbehavior
0
0 comments X
read the original abstract

Recent work on imitation learning has generated policies that reproduce expert behavior from multi-modal data. However, past approaches have focused only on recreating a small number of distinct, expert maneuvers, or have relied on supervised learning techniques that produce unstable policies. This work extends InfoGAIL, an algorithm for multi-modal imitation learning, to reproduce behavior over an extended period of time. Our approach involves reformulating the typical imitation learning setting to include "burn-in demonstrations" upon which policies are conditioned at test time. We demonstrate that our approach outperforms standard InfoGAIL in maximizing the mutual information between predicted and unseen style labels in road scene simulations, and we show that our method leads to policies that imitate expert autonomous driving systems over long time horizons.

This paper has not been read by Pith yet.

discussion (0)

Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.