Wield: Systematic Reinforcement Learning With Progressive Randomization

Eiko Yoneki; Kai Fricke; Michael Schaarschmidt

arxiv: 1909.06844 · v1 · pith:4JWRWFOWnew · submitted 2019-09-15 · 💻 cs.LG · stat.ML

Wield: Systematic Reinforcement Learning With Progressive Randomization

Michael Schaarschmidt , Kai Fricke , Eiko Yoneki This is my paper

classification 💻 cs.LG stat.ML

keywords wielddesignlearningreinforcementtaskrandomizationabstractionsaction

0 comments

read the original abstract

Reinforcement learning frameworks have introduced abstractions to implement and execute algorithms at scale. They assume standardized simulator interfaces but are not concerned with identifying suitable task representations. We present Wield, a first-of-its kind system to facilitate task design for practical reinforcement learning. Through software primitives, Wield enables practitioners to decouple system-interface and deployment-specific configuration from state and action design. To guide experimentation, Wield further introduces a novel task design protocol and classification scheme centred around staged randomization to incrementally evaluate model capabilities.

This paper has not been read by Pith yet.

Wield: Systematic Reinforcement Learning With Progressive Randomization

discussion (0)